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PREFACE 



In June 1986, a remarkable conference was held in Washington, DC. This 
conference, organized by the U.S. Department of Education's Office of 
Educational Research and Improvement, was intended to bring together the 
most influential researchers in regular as well as compensatory education to 
consider what was known about effective instruction that would be of use in 
the design of programs for students at risk of school failure. After two full 
days of paper presentations and lively discussion, a most disheartening 
consensus was reached. First, there was widespread agreement that existing 
Chapter I programs were having only marginal effects on the achievement of 
students at risk. Second, there was equally wide consensus that educational 
research had few if any sure answers to the question of how compensatory 
education might be changed to make a substantial difference in the educa- 
tional success of students at risk. 

Since that conference, it has become increasingly apparent that almost 
a quarter-century after President Johnson launched Title I (now Chapter I) 
as part of his War on Poverty, researchers and practitioners familiar with 
compensatory education arc questioning long-established policies and prac- 
tices, yet have few clear conceptions about what alternatives will be more 
effective than current practice. Part of the problem is that the federal 
investment in research and development of programs specifically designed 
for compensatory education has been near zero for more than a decade. 
Another is that simplistic beliefs about what is required to reform compen- 
satory education have taken some time to die. For example, much of the 
discussion about design of Chapter I programs has had to do with the 
location of the services (pullout versus in-class); it has taken more than ten 
years to discover that pullout and in-class Chapter I models arc equally 
effective, or more accurately, equally ineffective (sec Chapter 8). Experi- 
mentation and reform in Chapter I have also been constrained by regulatory 
limits and by overly restrictive readings of federal regulations by state and 
local administrators. Finally, real progress in design of compensatory 
education has been impeded by a false sense of complacency. School 
districts' Chapter I evaluations usually show healthy fall-to-spring gains in 
standardized test scores. The fact that comparable gains arc made by similar 
students not served by Chapter I programs has only recently been widely 
acknowledged. 
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Paralleling the sense of a need for new directions in compensatory 
education is a similar feeling in special education. As with the pullout versus 
in-class debate in compensatory education, special education has also been 
engaged in a debate about the location of services for students with mild 
academic handicaps, a debate which has diverted attention from the quality 
of services handicapped children receive. At the same time, special educa- 
tors arc becoming concerned about the 260 percent increase since 1976 in the 
number of students being served under the label learning disabled or its 
equivalent— students who would never have come to the attention of special 
education fifteen years ago. 

Both compensatory and special education arc at a watershed Both are 
dissatisfied with current practice and arc ready for change, yet neither has a 
clear direction for the future. 

This book was written to provide the best available information on 
what is known now about effective programs for students at risk of school 
failure, particularly those who arc currently served in compensatory and 
special education programs. The message of this book is that we know much 
more than we are currently using in programs for students at risk, and that 
while much more remains to be learned, we know how to proceed to 
discover how best to prevent and remediate learning deficits. We know that 
the tragic progression of events that begins with poor achievement in the 
elementary grades is not an inevitable consequence of low socioeconomic 
background, poor socialization, or inadequate skills at school entry; we 
know that well-designed school programs can keep students from starting in 
that descending spiral. Reform of compensatory and special education to 
ensure all students an adequate level of basic skills in the early grades will 
require a major restructuring, not fine-tuning, of existing programs. As much 
as the need for restructuring is becoming apparent to educators, the 
discussion of how to proceed is only beginning. This book helps to lay the 
empirical and intellectual groundwork for the changes that must come in 
programs for students at risk of school failure. 

Preparation of this volume was supported by a grant from the Office of 
Educational Research and Improvement, U.S. Department of Education, to 
the Center for Research on Elementary and Middle Schools at Johns 
Hopkins University (No. OERI-G-86-0006). In particular, we would like to 
thank Rone Gonzales, our project monitor, for suggesting and then encour- 
aging this work. However, the opinions expressed arc those of the authors, 
and do not necessarily represent OER1 positions or policy. 
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STUDENTS AT RISK 
OF SCHOOL FAILURE: 

THE PROBLEM 
AND ITS DIMENSIONS 

Robert E. Slavin 

Center for Research on Elementary and Middle Schools 
The Johns Hopkins University 



Virtually every child is capable of attaining an adequate level of basic skills. 
The recognition of this fact is an essential starting point for a discussion on 
education for students at risk of school failure. For example, there is little 
doubt that if students had highly skilled one-to-one tutors as long as they 
needed them, all but the most seriously dyslexic or mentally handicapped 
children (no more than 1 to 2 percent of the population) would learn enough 
reading and mathematics in the elementary grades to serve as a basis for 
success in the later grades. Yet, according to the National Assessment of 
Educational Progress (1985), almost a quarter of all seventeen-year-olds in 
school read below a level considered necessary to read simple popular 
magazines. Add to this percentage the approximately 14 percent of students 
who have dropped out by age seventeen and it becomes apparent that the 
problem of functional illiteracy is enormous in scope. 

The fact that a substantial proportion of students fail to attain an 
adequate level of basic skills says nothing at all about the capacity of these 
children to learn; rather, it says that given the level of resources we are able 
or willing to commit to the education of all students and the way in which 
these resources arc used in practice, a certain proportion of students will not 
succeed. While it is certain that virtually all students can learn, instructional 
methods and materials now in use are failing large numbers of students. 
Even under optimal instructional conditions, some students will require 
more resources, more time, or both to achieve an acceptable level of 
achievement. 

The problem of low achievement among huge numbers of students is 
hardly new, and in some ways it is improving. For example, although the 
high school dropout rate has remained at about 14 percent for whites ovci 
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the period from 1970 to 1985, for blacks it has diminished from 31 percent to 
19 percent (U.S. Department of Commerce, 1986). The National Assess- 
ment of Educational Progress shows improvements in the reading scores of 
blacks and Hispanics over the past decade (Carroll, 1987). Yet there is still a 
long way to go. International comparisons consistently find U.S. students to 
rank among the lowest in literacy and mathematical skills among industrial- 
ized nations, and minority students still perform substantially below their 
white classmates. 

While the problems of low achievement are not new, the consequences 
of this problem are becoming more serious. The U.S. economy no longer has 
large numbers of jobs for workers lacking basic skills. Recent studies of 
cities experiencing very high growth rates find that even when entry-level 
jobs (such as fast-food jobs) are plentiful, there is a substantial core of 
workers who cannot qualify for them because of poor basic skills. Allowing 
large numbers of disadvantaged students to leave school with minimal skills 
ensures them a life of poverty and dependence— the consequences of which 
are disastrous to the social cohesiveness and well-being of our nation. Yet 
the problem of low achievement is by no means restricted to poor or 
minority students; more than 10 percent of advantaged students lack the 
ability to read popular magazines, and only half have the reading skills 
considered necessary to read most newspaper stories or popular novels 
(NAEP, 1985). 

As many economists have pointed out. if the United States is to 
compete successfully in the world market, we must work smarter, not just 
harder. Real and lasting improvements in the standards of living of nations 
only come about through increases in the productivity of the workforce. As 
a nation, wc cannot afford to continue to allow our school systems to turn 
out students lacking in the skills necessary to becoming productive citizens. 
Wc cannot afford to allow children to start out on a path that begins with 
poor achievement and leads to truancy, behavior problems, delinquency, 
early pregnancy, and dropout. The economic costs, not to mention the social 
costs, of allowing this progression to unfold for so many students are 
intolerable. The negative spiral that begins with poor achievement in the 
early grades can be reversed. We know we can guarantee virtually all 
children adequate basic skills in the elementary school, and if we choose to 
do so. there is every likelihood that we could dramatically increase the 
school success of large numbers of students and consequently the quality of 
life of our society. 



DEFINING "AT RISK" 

Recently, educators have begun to use the term at risk to describe a certain 
category of students. The meaning of this term is never very precise, and 
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varies considerably in practice. One possible definition is that students who 
ai'e at risk are those who, on the basis of several risk factors, are unlikely to 
graduate from high school. Among these risk factjrs would be low achieve- 
ment, retention in grade, behavior problems, poor attendance, low socioeco- 
nomic status, and attendance at schools with large numbers of poor 
students. All of these factors are closely associated with dropping out of 
school, and research has found by the time students are in the third grade, 
we can fairly reliably predict which students will ultimately drop out and 
which will complete their schooling (Howard and Anderson, 1978; Lloyd, 
1978; Kelly, Veldman, and McGuirc, 1964). In practice, however, different 
factors have different predictive value depending on student age and other 
variables. For example, in looking at preschool students, the best predictors 
of dropout and other school problems arc socioeconomic status indicators 
(Schreiber, 1968). As students move through the grades, their actual 
performance in school becomes a much better predictor; grades, attendance, 
and retentions of sixth graders, for example, are very highly predictive of 
dropout (Lloyd, 1974). 

The probability that a student will complete high school is not the only 
rational criterion for designating students as being at risk. For instance, we 
might define as at risk those students who are unlikely to leave school (at 
whatever age) with an adequate level of basic s<ills. With the increasing use 
of competency-based graduation requirements, we might define as at risk 
those students who are unlikely to pass criterion-referenced graduation 
tests. And, as still another possibility, we might use criteria such as being 
at risk for failing one or more grades, or being assigned to special educa- 
tion. 

Each of these definitions would produce a somewhat different set of 
students held to be at risk, but there would be considerable overlap among 
them. The group we are focusing on is those students whose intelligence is 
within normal limits but who are failing to achieve the basic skills necessary 
for success in school and in life. A practical definition of at risk might be 
those students who are presently eligible for special or compensatory 
education. 

This book focuses on programs for students who arc at risk by any 
of the definitions discussed above, with one major exception: students 
who might be considered at risk solely on the grounds that they speak a 
language other than English. (For discussions of issues related to the 
education of students with limited English proficiency, see Carter and 
Scgura, 1979; Ramirez, 1986; Willig, 1985; Wong-Fillmorc and Valadcz, 
1986). 

What do we know now about effective programs for students who arc 
at risk for school failure? What do we need to know to better serve these 
students? These arc the central questions posed in this book and addressed 
by the various chapters. 

er|c < i 
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TYPES OF PROGRAMS FOR STUDENTS 
AT RISK 

Thv.c arc three major types of programs for students at risk: compensatory 
or other remedial programs, special education programs, and general 
education programs. Each of these has its own history, legal requirements, 
and practical constraints. 

Chapter 1/Titlc I 

Compensatory education refers primarily to federal programs targeted 
toward low-achieving, disadvantaged students. The largest compensatory 
program by far is Chapter I (formerly Title I), currently budgeted at $4.3 
billion per year. One in every nine students receives Chapter I services— a 
total of almost five million children (Guttman and Henderson, 1987). More 
than 90 percent of school districts receive Chapter I funds (Kennedy, Jung, 
and Orland, 1986). Chapter I funds are given to schools on the basis of the 
number of low-income students they serve, but within schools they are used 
to serve students according to their educational needs, not their poverty 
level. Because of this, and because nonpoor students so outnumber poor 
ones, the majority (58 percent) of students receiving Chapter I services arc 
not themselves from families in poverty (see Figure 1-1, from Kennedy, 
Jung and Orland. 1986). However, poor students arc disproportionate 
recipients of Chapter I services, as are black and Hispanic students. Figure 
1-1 also shows that the great bulk of Chapter 1 services go toward 
elementary schools (grades K-8). Most Chapter 1 funds provide instruc- 
tional services to students in reading, mathematics, and/or language, as 
illustrated in Figure 1-2 (from Kennedy, Birman, and Dcmalinc, 1986). 

For further discussion of the history and purpose of Chapter I /Title I. 
sec Chapter 6. 

Models of Chapter I Service Delivery. Two guiding principles of 
delivery of Chapter I services arc that only eligible low-achieving students 
may benefit from these services, and that the services must supplement, net 
supplant, local educational efforts. The first of these, which typically limits 
the use of Chapter I funds to students who score below a certain cutoff score 
on standardized tests (e.g., below the 40th percentile), keeps most schools 
from using Chapter 1 funds to improve the school overall, for example by 
reducing class size or implementing more effective practices in the school as 
a whole (the exception is schoolwidc projects, described below). The 
"supplement, not supplant" requirement generally keeps schools from using 
Chapter I funds to provide services that non-Chapter 1 students receive out 
of local funds. For instance, a district could not provide preschool or 
summer school programs for low-achieving or disadvantaged students out of 



9 

ERIC 



Students at Risk of School Failure 



7 



FIGURE 1-1. Percent of Title I/Chapter I Students and All Students with Various 
Characteristics 
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Figure reads: "Among Title I students in 1976-77, 42 percent were poor. Among the 
student population in general in 1976-77. 21 percent were poor." 

Source: M. M. Kennedy. R. K. Jung, and M- E. Orland, Poverty, mhievemem. and the 
distribution of compensatory education services (Washington. DC. Office of Educational 
Research and Improvement, U.S. Department of Education. 1986). p. 7t. 



Chapter I funds if it also provided similar programs for non-Chapter 1 
students Ov:t of local funds. A small army of state regulators audit Chapter I 
programs to make sure that funds arc spent only on eligible students and that 
they supplement local efforts. 

There arc five principal models of service delivery used under Chapter 
I funding: pullout, in-class, add-on, replacement, and schoolwidc. In 
pullout, students arc taken out of their homeroom classes for thirty- to 
forty-minute periods, during which time they receive remedial instruction in 
a subject with which they arc having difficulty, usually from a certified 
Chapter I teacher and usually in a class of eight or fewer pupils. In in-class 
models, the teacher (or, more commonly, an instructional aide) works with 
eligible students within the regular classroom setting. Add-on programs 
provide services outside of the regular classroom, as in summer school or 
after-school programs. (An increasingly popular option, using Chapter I 
funds to provide prc-kindcrgartcn programs or to extend kindergarten to a 
full day, might also be considered an add-on model,) Replacement models 
involve placing Chapter I students in self-contained classes in which they 
receive most or all of their instruction. These programs require school 
districts to provide additional local resources to supplement Chapter I 
funds. Schoolwidc projects arc those in which all students in a high-poverty 
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FIGURE 1-2. Percent of Chapter I Students Receiving Instructional and Non- 
instructional Services.* 1983-84 
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Figure reads: "Seventy-five pcrccm of all studenis enrolled in Chapter I programs 
received supplementary reading instruction during the 1983-84 school year. 46 percent 
received supplementary mathematics instruction." 

* Total number of students = 4.85 million. If children receive more than one service. 

they arc counted in each subject in which they arc enrolled. 

** Includes vocational instruction and special services for handicapped students. 

Source: M. M, Kennedy. B, P. Birman. and R, E. Dcmalinc. The effectiveness of 
Chopler I services (Washington. DC: Office of Educational Research and Improvement. 
U.S. Department of Education. 1986). p. 2. 
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school can benefit from Chapter I funds. To date, schoolwidc projects have 
been rare, as they could be used only in schools in which at least 75 percent 
of students were in poverty and in which the district was willing to provide 
matching funds to supplement the Chapter I allocations. New federal 
legislation has removed the matching fund requirement, so schoolwidc 
projects may become more common among high-poverty schools. 

Although use of in-class, add-on, and replacement models has in- 
creased in recent years, Chapter I funds still overwhelmingly provide pullout 
programs. Figure 1-3 shows that in cler tcntary schools, pullout designs are 
used in 84 percent of all Chapter I reading programs and 7o percent of math 
programs— more than all other models combined. Part of the reason for this 
is that pullout models most clearly fulfill the "supplement, not supplant" 
requirement of Chapter I regulations: in in-class models in particular, there 
is always concern about the possibility that teachers or aides present in the 
regular classroom will be helping ineligible as well as eligible students (sec 
Chapters 7, 8, and 11). 

How Effective Is Chapter 1? 

Chapter 1 has a very well defined purpose: to increase the reading and 
mathematics achievement of low-achieving students within schools with 
concentrations of students from families of lew socioeconomic status. 
Measurement of the outcomes of Chapter I services would appear to be 
straightforward, but in practice this is no! the case. 

One major problem is that the overwhelming majority of school 
districts evaluate their own Chapter I programs by comparing standardized 
achievement test scores obtained from program participants in the fall to 
those obtained in the spring. If the average percentile score or Normal Curve 
Equivalent (NCE)* score increases, then the program is held to be success- 
ful, because the students have gained relative to the norming sample used by 
the test publishers. There is nothing wrong with this evaluation design in 
theory, but in practice it has turned out that fall-to-spring evaluation designs 
greatly overestimate true gains, for reasons that arc only dimly understood. 
One indication of this is that nationally, Chapter I students show fall-to- 
spring gains of 7 to X percentile points, but these gains arc essentially wiped 
out over the summer, fall-to-fall or spring-to-spring gain.* average 1 to 2 
percentile points at most (sec Gabriel, Anderson, Benson, Gordon, Hill, 
Pfanncnsticl, and Stonchill, 1985). Even these gains may be overstated, as 
annual gains can be influenced by such factors as entry and exit from 



* A Normal Curve Equivalent is a standard score thai has a mean of 50 and a 
standard deviation of about 21, NCEs resemble percentile scores but are on an 
equal-interval scale. 



FIGURE 1-3. Settings in Which Chapter 1 Reading and Mathematics Are 
Provided by Public Schools, as Reported by School Principals. 1985-86 
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Figure reads. "Of all public elementary schools that offer Chapter I reading instruction, 
principals in 28 percent report use of an in»class setting to teach Chapter I reading." 

Source: B. F. Uirman. M. II. Orland. R. K. Jung. R. J. Anson. G. N. Garcia. M. T. 
Moore. J. E. Funkhouscr. I). R. Morrison. B. J. Turnbull. and E. R. Rcisncr. The 
current operation of the Chapter I program (Washington. DC. Office of Educational 
Research and Improvement. U.S. Department of Education. 1987). p. 63. 
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Chapter I, district promotion/retention policies, and other district practices 
such as increasing alignment of curriculum with the objectives assessed on 
the standardized tests. 

Since random assignment to Chapter I programs would probably be 
illegal, the best permissible research design for evaluating Chapter I is to 
compare students receiving Chapter 1 services to matched students who did 
not receive services. This design, which is rarely used in practice, has one 
major drawback, namely that there may be systematic reasons that certain 
students do not receive services. For example, a student scoring at the 20th 
percentile in reading who docs not receive services may be located in a 
non-Chapter I school, whereas a matched Chapter I student scoring at the 
same level is in a school with many poor classmates. However, the control 
group design docs not suffer from the fall-to-spring inflation or from 
confounding with the district policies, so it is the most objectively meaning- 
ful evaluation. 

The differences in conclusions depending on the type of evaluation 
design arc starkly illustrated by the results of an evaluation of the Oklahoma 
City Chapter I program (Kimball, Crawford, and Raia. 1985). This evalu- 
ation presented the data three different ways, as shown in Table l-l. The 
table shows that different evaluation designs lead to different conclusions. 
The fall-to-spring and spring-tc-spring gains indicate an extraordinarily 
effective Chapter I program, as these gains are considerably greater than the 
national seven- to eight-point fall-to-spring gain and one- to two-point 
spring-to-spring gain. Yet comparing Chapter I students to matched control 
students, the difference is less than two NCE points in reading and less than 
three in math. Similarly, an evaluation in Baltimore found that Chapter I 
students gained about eight NCE points from fall-to-spring, but so did 
matched students who received no services (Granick, Quiglcy, Katzcncllcn- 
bogen, and Richardson, 1984). A later comparison of served and unserved 
students in Baltimore also found negligible differences (Pindcrhughcs, 
Richardson, Granick, and Katzcncllcnbogcn, 1986). 

The best national assessment of the effects of Title I is the now rather 
dated Sustaining Effects Study (Carter, 1984), which compared achievement 
gains made by Title I students in 1976-77 to matched "needy" students and 



TABLE 1-1, Oklahoma City Chapter I Evaluation, 
Grades 1-6: NCEs Computed Three Ways 





Reading 


Math 


Fall-to-spring NCE gain 


13.0 


17.1 


Spring-to-spring NCE gain 


5.9 


8.0 


Expcrimental'control difference 


1.9 


2.8 
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FIGURE 1-4, Reading and Mathematics Achievement of Students Receiving and 
Not Receiving Compensatory Education, Sustaining Effects Study, 1976-77 

Vertical Scale Score 





CCA _ 

DOU 




600 - 




550 ~ 


E 


500 - 


T3 


450 - 


© 

cc 


400 - 




350 - 




300 - 

y 







' 1 " 2 
Vertical Scale Score 
650 "l 

600 
550 



TT 



3 ' 1 4 
Grade 



T r 



TT 



CO 

| 500 H 
J 450 
| 400 H 
350 
300 H 



■ 1 " 2 " ' 



TT 



TT 



3 4 
Grade 

Non compensatory Education students In Title I schools 
Title I students 

Needy students in noncompensatory Education schools 



f igure reads; "The vertical scale scores of Title I first-grade students for reading and 
mathematics increased more from the fail to the spring than did those of similar students 
not enrolled in Title I schools, yet Title I first graders started behind regular first graders 
in Title I schools who did not receive Chapter I and failed to catch up by the spring." 

Source: M. M. Kennedy. 1). F. Birman. and R. E. Demalinc. The effectiveness of 
Chapter I services (Washington. DC: Office of Educational Research and Improvement 
U.S. Department of Education. 1986). p. 34. 
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to a representative sample of non-needy students. Figure 1-4 shows that 
Title 1 students did generally make greater gains in reading and math than 
other needy students, but these gains were not adequate to close the gap 
between Title I and non-needy students. Table 1-2 summarizes the same 
data in standard deviation units. Note that in comparing Title 1 and matched 
needy students, only in first grade did differences exceed 15 percent of a 
standard deviation (roughly three NCE or percentile points). 

The Sustaining Effects Study has been criticized on the basis that the 
comparison group of needy students may not have been truly equivalent. 
However, a recent reanalysis correcting for selection bias found essentially 
the same differences (Myers, 1986). 

It is of course possihlc that in the years since the Sustaining Effects 
data were collected (1976-77), improved practices within Chapter 1 may 
have led to markedly improved outcomes. However, in light of the Ok- 
lahoma City and Baltimore experimental-control comparisons this seems 
unlikely. These and other data point to a conclusion that Chapter 1 /Title I 



TABLE 1-2. Growth of Three Groups of Students Participating in the 
Sustaining Effects Study. 1976-77 (Expressed in Standard Deviation 
Units*) 





v 

Representative 
Sample 


Titie f 
Students 


Needy Students 
With No CE 


Reading 








Grade 1 


1.98 


1.79 


1.60 


2 


.87 


.85 


.77 


3 


,61 


.64 


.53 


4 


.46 


.50 


.49 


5 


.42 


.38 


.34 


6 


.37 


.37 


.37 


Math 








Grade 1 


1.75 


1.76 


1.40 


2 


1.24 


1.19 


1.04 


3 


1.21 


1.13 


1.03 


4 


.84 


.90 


.79 


5 


.70 


.68 


.55 


6 


.58 


.64 


.49 



* All gains arc cof&rrtcd to standard dcvialion units, using the standard 
deviation of the fall scores of the representative samples. 



Souree: M. M. Kennedy. B. F. Birman. and R. E. Dcmalinc. The 
effcetiveness of Chapter I serviees (Washington. DC: Office of Educational 
Research and Improvement. U.S. Department of Education. 1986). p. 31. 
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may be effective in the early grades, but if so, the effects arc small, on the 
order of two to three percentile points. Perhaps this is as much as we should 
expect from an expenditure of only about $500 per student per year, but it is 
clear that a few more sight words or a little more speed doing arithmetic 
problems will hardly change the chances that a disadvantaged youngster will 
ultimately succeed in school or in life. 

Because of the importance of Chapter ! for large numbers of students 
at risk for school failure, most of the chapters in this book discuss Chapter I , 
and all of the chapters have direct bearing on the design of instructional or 
remedial programs that could in principle be funded by Chapter I. In 
particular, Chapter I programs, as they exist today, arc described in Chapter 
7, and effective teaching behaviors within Chapter I pullout programs arc 
discussed in Chapter 9. Chapter 8 reviews research on design features of 
Chapter 1 programs, and Chapters 2 and 3 present information on effective 
classroom and puilout programs, respectively, which could be used to 
provide compensatory services. 



Head Start 

Head Start is a federal compensatory program for students from age three to 
school entry. Head Start programs provide a half-day preschool setting for 
children from low-income families with activities designed to enhance their 
sociocmotional and cognitive growth, and most programs also provide 
health, nutrition, and/or family support services (see Ziglcr and Valentine, 
1979; McKcy. Condclli, Ganson, Barrett, McConkcy, and Plantz, 1985). 
Head Start and other compensatory preschool programs arc discussed in 
Chapters 4 and 6. 



Follow Through 

Follow Through refers to a federal funding program originally meant to 
maintain the successes of students in Head Start by building in special 
services for low-income students in the early elementary grades. However, 
in part because of limited funding, Follow Through soon became a source of 
funding for model programs designed to help the children of the poor. During 
the 1970s, Follow Through funded a massive Planned Variation Experiment, 
in which various innovative programs were developed, implemented, and 
compared to control groups and to one another in terms of effects on student 
achievement. Planned Variation is discussed in Chapters 2 and 6. Cutbacks 
in funding have reduced Follow Through to a minor role in federal 
compensatory efforts. 
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Special Education 



Special education services hav* long been provided to students who have 
identified handicaps. Since the passage of Public Law 94-142 in 1975, school 
districts have provided a continuum of services for handicapped students, 
ranging from special schools to special classes within regular schools to 
various part-time placements. In these programs, students typically receive 
instruction in very small groups from teachers with certification in special 
education. Eligibility for special education depends on assessments of 
individual students' levels of functioning, and a variety of procedural and 
legal safeguards provided for in PL 94-142 arc intended to ensure that 
students receive appropriate services in the 'least restrictive environment" 
(see Chapter 6). 

One of the most important trends in recent years relating to the subject 
of this book is a substantial increase in the number of students with mild 
academic handicaps who arc receiving special education services. Table 1-3 
shows that although the percentage of students categorized as physically 
disabled and mentally retarded has stayed at about the same level over the 
period 1976-1985, the number of students categorized as learning disabled 
increased by about 260 percent. Almost 90 percent of this increase repre- 
sents the entry into the special education system of low achievers who 
would not have been served in special education in the past. In other words, 
special education has assumed a substantial burden in trying to meet the 
needs of students at risk of school failure. Yet research comparing students 
with mild academic handicaps in special education to similar students left in 



TABLE 1-3. Percent of Children Served in Educational Programs for the 
Handicapped 



Type of Handicap 



Percent of Children Axed 
3-21 Served 
1976-77 1980-81 1984-85 



All conditions 



8.33 
1.80 
2.16 
0.64 
2.94 
0.81 



10.11 
3.57 
2.02 
0.85 
2.85 
0.83 



11.08 
4.67 
1.82 
0.95 
2.87 
0.78 



Learning disabled 
Mentally retarded 



Emotionally disturbed 
Speech impaired 

Other (deaf, blind, health impaired) 



Sonne. Adapted from U.S. Department of Education. Office of Special Educa- 
tion and Rehabilitative Services, Eighth Annual Report an Congress on the 
Implementation of the Education of the Handu tipped Ait. 1986. 
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regular classrooms finds few if any benefits for this very expensive service 
(sec Lcinhardt and Pallay, 1982; Madden and Slavin, 1983). Special educa- 
tion for students at risk is discussed in Chapter 6, and effective instructional 
practices in the regular classroom for mainstreamcd academically handi- 
capped students arc discussed in Chapter 10. 

General Education Programs for Students at Risk 

Often ignored in discussions of programs for students at risk is the fact that 
many programs and practices typical of general education have important 
consequences for students who arc at risk of school failure. For example, 
tracking and other forms of between-class ability grouping arc often held to 
be beneficial for low achievers by putting success within their reach 
(although research generally fails to support this; sec Slavin, 1987a). Most 
successful innovations in classroom practices or school organization have 
positive effects on low- as well as average- and high-achieving students. A 
major goal of education is to bring all students to an acceptable level of 
achievement. Therefore, if certain instructional methods had important 
positive effects on the achievement of all students, including low achievers, 
these methods could be justified for use as means of serving the needs of 
students at risk. For example, continuous-progress programs and coopera- 
tive learning programs tend to be beneficial for all students, not just low 
achievers (see Chapter 2). Research generally finds that teacher behaviors 
that arc successful with low achievers tend to be very similar to those 
successful with all students (sec Chapters 6, 9, and 10). Thus it is likely that 
if programs focusing on improving teachers' general instructional skills arc 
successful with low achievers, they will also be effective with other 
students. 

On the other hand, there arc general instructional strategics that may 
differentially affect high and low achievers. For instance, there has been 
much discussion of the potential "Robin Hood" effect of group-based 
mastery learning (see Arlin, 1984; Slavin, 1987b), in which instructional time 
is taken away from high achievers to be used to provide corrective 
instruction to students who initially fail to master a given set of objectives. 
Another example of differential effects is departmentalization. Becker (1987) 
found that having sixth-grade teachers specialize in one subject (so that 
students had different teachers for each subject) had a somewhat positive 
effect on the achievement of students from families high in socioeconomic 
status, but had a marked negative effect on iow-SES students. 

Discussions of potentially effective practices for students at risk have 
often been limited in that they focus on only one form of service, such as 
Chapter 1, special education, preschool, or effective classroom programs for 
low achievers. However, rather than focusing entirely on pieces of the 
solution, we need clearly to keep in mind that there is only one problem to be 
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solved— the problem of low achievement. Although it is true that this one 
problem has many facets, it is critical to step back from current practice to 
ask what should be the response of the educational system overall to 
ensuring all students an adequate level of basic skills in the elementary 
grades. To restate the assertion made at the beginning of this chapter, it is 
certain that virtually all students can learn. But what kinds of programs and 
what allocations of resources v^ill most effectively bring about a state in 
which far more students will be successful in their early schooling? To 
answer this, we must think beyond the confines of current practice. We must 
learn what we can from the programs that exist today, but never forget that 
these programs arc products of the prevailing conditions when they were 
formulated, and can in principle be altered if we find other programs or 
patterns of resource al'ocation to be more effective. It is certain that if we 
continue to do what we arc doing now, we will continue to experience the 
same results, and it is equally certain that the results we arc obtaining now 
arc not enough. If we arc to use the schools to make a significant change in 
the lives of students who are on a path leading toward school failure, we 
need to make substantial changes in regular, compensatory, and special 
education. This book is intended to provide an empirical and conceptual 
basis for the changes that must come in programs for students at risk. 
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In recent years, there has developed a growing dissatisfaction with the 
educational services provided to students who are at risk for developing 
learning problems or school failure. Much of this dissatisfaction has cen- 
tered on the practice of pulling low-achieving students out of class for 
remedial instruction, usually provided as part of the federal Chapter I 
(formerly Title I) program (sec Chapters 6-8). Some researchers have found 
that the more time students spent in pullout programs, the less they learned 
(Coulson, Hanes, Ozennc, Bradford, Dohcrty, Duck, and Hemcnway, 1977; 
Glass and Smith, I97 7 ). Chapter I pullouts have been criticized on the basis 
that instruction in the pullout program is rarely well integrated with that 
provided by the regular classroom teacher (Kaestlc and Smith, 1982; 
Johnston, Allington, and Afflcrbach, 1985). Also, much time is lost in 
transitions between regular and pullout settings, and pullouts rarely increase 
the total instruction provided to students (Vanccko, Ames, and Archam- 
bault, 1980; Kimbrough and Hill, 1981). Similar problems of coordination 
and supplanting of regular classroom instruction have been noted for special 
education pullouts (Sargent, 1981) and when students qualify for special 
education and remedial services, these problems multiply (Birman, 1981). 

Although the problems of pullouts arc well recognized, alternatives to 
pullouts have their own drawbacks. Having the remedial teacher work in a 
corner of the regular classroom, as is typical in in-class Chapter I models, 
provides no guarantee that coordination with the regular program will be 
enhanced (sec Lee and Rowan, 1986). In-class models are often structurally 
identical to pullout programs except that the remedial services arc provided 
in a different location. Research comparing pullout and in-class Chapter 
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1 /Title I models has found few differences in achievement effects (sec 
Chapter 8). 

One point on which there is agreement among virtually all reviewers of 
research on Chapter I and special education programs for students at risk is 
that program quality is far more important than the setting in which the 
program is implemented (sec Chapters 6 and 8; Lcinhardt and Pallay, 1982; 
Madden and Slavin, 1983; Archambault, 1987). Clearly, there can be 
effective pullout programs and effective in-class programs to meet the needs 
of at-risk students. This chapter examines research on classroom programs 
for elementary school students who arc at risk for learning problems. The 
full range of alternative classroom organization models designed to meet the 
needs of low-achieving or heterogeneous classes is explored; the principal 
question this chapter asks is: How can the educational needs of all students 
be met by fundamentally restructuring the regular classroom, as opposed to 
adding on services outside of the regular classroom? 



This chapter focuses on a review of specific, comprehensive programs that 
may be beneficial for the achievement of students who arc at risk for 
learning problems. A program is defined here as a set of procedures 
intended to be implemented as a total package and capable of being 
replicated by others. These features distinguish a program from a set of 
variables. For example, there is evidence that such variables as strong 
principal leadership, high expectations, and high timc-on-task arc related to 
student achievement gains (e.g., sec Brophy and Good, 1986). However, 
these variables do not in themselves constitute a program; a program would 
lay out a specific set of procedures that might incorporate these variables 
(and others) but would do so in a structured and replicablc format. The word 
replicable is particularly important in the definition of a program. It is often 
the case that a particular school or district has considerable success with 
at-risk students, but the success is due to the unusual skill or devotion of a 
set of individual principals or teachers rather than to any particular set of 
practices that can be easily transmitted to others. 



This review used a set of procedures adapted from best-evidence synthesis 
(Slav in, 1986), a review method that combines features of meta-analytic and 
traditional narrative reviews. In essence, a priori inclusion criteria relating 
to internal and external validity were established and a broad literature 
search was conducted to identify studies that met those criteria. When 
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possible, achievement effects of the qualifying studies were characterized in 
terms of effect size, the difference between experimental and control means 
divided by the control groups' standard deviation (Glass, McGaw, and 
Smith, 1981). However, the strengths and weaknesses of the individual 
evaluations arc described to set the effects in context. These procedures arc 
described in more detail in the following sections. 

Literature Search 

Every effort was made to search the literature on instructional programs for 
elementary-aged students (grades 1-6) as broadly as possible. Academic 
journals yielded a few relevant studies, but research on comprehensive 
instructional programs is rarely reported in these journals. The most fruitful 
. ourcc of information by far was reports submitted to the Joint Dissemi- 
nation Review Panel (JDRP), a U.S. Department of Education panel that 
reviews evaluations of programs originally supported by federal funds. 
Programs whose effects arc certified as valid by the JDRP arc eligible for 
funding and dissemination through the National Diffusion Network (NDN). 
What makes the JDRP submissions such a rich source of information for 
identifying effective programs is that, with few exceptions, the programs 
submitted to the JDRP were designed to be rcplicablc. The main reason for a 
school district or research and development organization to go through 
JDRP review is to obtain funds to disseminate a program nationally. Thus 
data on programs that could not be easily transported to a new location 
would be unlikely to be submitted to the JDRP, Also, the JDRP review 
process demands enough data to enable reviewers to determine program 
effects. The submissions for more than 300 programs which successfully 
passed JDRP review were obtained from the Department of Education. 
Also, the Northwest Regional Education Laboratory recently published an 
Effective Compensatory Education Sourcebook (Griswold, Cotton, and 
Hansen, 1986) which listed I !6 district Chapter I programs nominated and 
selected as being particularly effective. Data and program descriptions were 
requested from these districts. Finally, ERIC documents, journal articles, 
and other reports were obtained in a general literature search relying in 
particular on references from other reviews (e.g., Ellson, 1986). 

Substantive Inclusion Criteria 

In order to be considered germane to the scope of this review, studies had to 
conform to a broad set of criteria: 

I. The programs evaluated had to be directed toward increasing the 
reading and/or mathematics achievement of students in grades 1-6. 
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Programs that were designed primarily for use in kindergarten arc 
discussed in Chapter 5, whereas those that overlapped the elementary- 
grades (e.g., K-2, K-6, K-12) arc included here. 

2. The programs had to be implemented in regular classrooms. This 
excludes self-contained special education programs. Pullout and stor- 
ing programs are reviewed in Chapter 3. 

3. The programs had to be applicable to the education of at-risk students. 
This criterion was interpreted very broadly. Even if programs did not 
present evidence that they had been successfully evaluated in schools 
containing disproportionate numbers of at-risk students, they were 
included if they could, in principle, be applied to such students. 
However, a lack of data on effects for at-risk students is noted as a 
drawback in descriptions of effective programs. 

4. Programs specifically designed for non-English speakers are not in- 
cluded in this review. 



Methodological Inclusion Criteria 

Unfortunately, the methodological quality of most evaluations of instruc- 
tional programs is very low. Many evaluations compare the achievement of 
students in experimental programs lo that of students in "comparable" 
schools, with no evidence given that the comparison schools were in fact 
comparable before the treatments were administered. Others lack even this 
much control, claiming that since students achieved above "grade level 
expectations, " the program must be successful. 

Perhaps the most common reason for invalid evaluation design is the 
use of fall-to-spring gains in percentiles or, more commonly, normal curve 
equivalents (NCEs). A normal curve equivalent is a standard score with a 
mean of 50, a standard deviation of about 2 1, and a range of I to 99. Thus 
NCEs arc similar to percentile scores, but they arc preferable because they 
use an equal-interval scale. In recent years, districts receiving Chapter I 
funds have been encouraged to report fall-to-spring gains in NCEs for their 
Chaplcr I recipients, which helps explain the popularity of this design. 

The rationale behind the fall-to-spring design is sensible in theory. If 
students score at, say, a mean NCE of 40 in the fall and 50 in the spring, then 
they have apparently gained relative to the national norming group, which 
should have remained at the same level on average. However, in recent 
years it has bneome apparent that this assumption is incorrect. Among 
Chapter I populations, NCE gains of about eight points arc routinely made 
from fall to spring (sec Gabriel, Anderson, Benson, Gordon, Hill, Pfanncn- 
stiel, and Stonchill, 1985; Kcesling, 1984). However, by the following fall, 
these differences have generally disappeared. The fall-to-spring gain is 
probably a statistical artifact, not a true effect of Chapter I. For example, 
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Chapter l-cligiblc students who do not actually receive Chapter I services 
make similar gains (Carter, 1984: Granick, Quiglcy, Katzcnellcnbogen, and 
Richardson, 1984), Almost all of the "exemplary" Chapter I programs cited 
by Griswold, Cotton, and Hansen (1986) that provided multiyear data 
showed a pattern of fall-to-spring gains followed by equal spring-to-fall 
declines; the fall-to-fall or spring-to-spring gains were essentially nil over as 
many as five years. National data reported by Gabriel ct al. (1985) indicate 
that the fall-to-spring effect is largest in the younger grades, diminishing 
from almost 9 NCEs in grade 2 to 4.5 in grade 12. 

Unfortunately, the cause of the fall-to-spring artifact is unknown (sec 
Kennedy, Birman, and Dcmaline, 1986), and it is unlikely that the gains arc 
equal across tests, types of students, or other factors. Therefore, it is not 
even possible to assume that because a particular district reports gains much 
more than 8 NCEs that this is evidence of an effective program. There arc 
examples or school districts in which fall-to-spring gains of 16 NCEs or more 
are reported each year, yet no growth from fall-to-fall or spring-to-spring is 
evident. 

Program evaluations that use fall-to-fall or spring-to-spring NCE or 
percentile gains as a criterion for program effects arc better than fall-to- 
spring studies, because the expected gains for annual testing arc much 
smaller, in the range of 1-3 NCEs (Gabriel et al. 1985). Yet these have 
serious drawbacks as well. Some evaluations report annual gains for 
successive cohorts of students at a particular grade level (e.g., third graders 
in 1978-80 versus third graders in 1980-81). Cohort differences can be 
affected by many extraneous factors, such as changes in district testing or 
promotion procedures. However, this design is preferable to a comparison 
of this year's gain to last year's for the same students (e.g., comparing the 
gains made by this year's fourth graders compared to those made by last 
year's third graders). Such designs confound programs' effects with re- 
tention/promotion, selection into or out of Chapter I or other special 
programs, and other factors. Comparing last year's gains to this year's for 
promoted students only is a much better procedure, but is rarely used; 
including nonpromoted students inflates estimates of annual gain because 
these students are being compared again to the same grade-level norms after 
a year of instruction and growth. 

The inclusion criteria applied in the present review follow from the 
above discussion. The purpose of these criteria is to extract from a large, 
diverse, and messy literature the most convincing evidence available at 
present on effective classroom programs for students at risk. The criteria arc 
as follows: 

I. Convincing evidence of effectiveness had to be presented. Unfor- 
tunately, the nature of existing program evaluation data and of the sources of 
reports on such data precludes identification of programs that arc not 
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effective. The JDRP and exemplary Chapter I programs come to our 
attention only because they were fell to be outstanding. A pattern of 
fall-to-spring gains and spring-to-fall losses cannot be used as evidence of 
effectiveness, but neither is it convincing evidence of ineffec tiveness. In a 
usual review of the literature on experimental programs, effective and 
ineffective programs would be compared in an attempt to identify consistent 
characteristics of the more effective ones (sec Cooper, 1984: Light and 
Pillcmer, 1984: Slavin, 1986). However, in the present case, the review 
focuses on those programs with convincing evidence of effectiveness, as 
distinct from others that may or may not be equally effective but do not 
present convincing data. A discussion of programs considered for inclusion 
but not included and reasons for exclusion is presented later in this chapter. 

The remaining methodological inclusion criteria define what is meant 
by "convincing evidence of effectiveness. " 

2 Evaluations had to use control group designs with random assignment to 
groups and/or convincing evidence that comparison groups were initially 
equivalent in achievement. Studies that showed spring-to-spring gains of at 
least 7 NCEs were also included, but in a separate category. This is about 
one-third of a population standard deviation (approximately 21), but more 
like half of the standard deviation typical of Chapter I students, who have a 
more restricted range of scores. One important category of programs 
excluded from the set of effective models is those that were involved in the 
Follow-Through Planned Variation evaluation (Kennedy. 1978) and were 
found to be no more effective than control groups overall, even though they 
were found to be effective in one or more sites. In many cases, data from the 
successful sites passed JDRP review, as the JDRP procedures do not require 
that data from ail sites be submitted or considered. 

3 Evaluations had to use standardized, broadly based measures of reading 
and/or mathematics achievement. This excluded a small number of studies 
that used cxpcrimcntcr-madc measures or only assessed a narrow set of 
reading or mathematics skills. 

4. The duration of evaluations had to be at least one semester (sixteen 
weeks) Almost all JDRP and Chapter I evaluations cover at least a one-year 
period, but many evaluations of promising programs that have appeared in 
academic journals have involved much shorter treatment durations. To be 
considered a practical evaluation of a rcplicablc program, a duration of one 
semester is a minimal requirement. 



CATEGORIES OF EFFECTIVE MODELS 

The principal results of the search for instructional methods with convincing 
evidence of positive effects arc^summarized in Tables 2-1 through 2-3. In 
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each table, those programs that evaluated studies that employed matched or 
(rarely) randomized control groups arc listed first, with associated effect 
sizes. These effect sizes should be interpreted very cautiously and should 
not be compared across methods, but arc included to characterize tHc 
relative magnitudes of effects on different outcomes or for different sub- 
populations within programs. 

The effective models arc discussed according to catagorics of pro- 
grams. Surprisingly, among the dozens of programs evaluated for possible 
inclusion, programs that presented convincing data fell into only three 
categories, continuous progress, individualized instruction, and cooperative 
learning. 



Continuous Progress 

The largest number of programs with convincing evidence of effectiveness 
arc those falling in the category labeled continuous progress. The individual 
programs in this category vary in many particulars, but features common to 
all include the following: 

1. Students proceed through a well-specified hierarchy of skills. They arc 
tested at each level to determine their readiness to move on to the next 
skill. Special procedures are established to assist students who fail to 
pass regular mastery assessments. These may include corrective in- 
struction in small groups, tutoring, assignment to different groups, or 
special materials or activities. In these respects, continuous progress 
programs resemble mastery learning and they arc often described as 
such. However, continuous progress programs arc fundamentally differ- 
ent from the group-based mastery learning programs more commonly 
seen in elementary and secondary schools (sec Slavin, 1987a). (Group- 
based mastery learning is discussed later in this chapter.) 

2. Careful records arc kept of each student's progress through the curricu- 
lum. These data arc used to make grouping, remediation, and other 
decisions. 

3. Most instruction is delivered by teachers to groups of students at the 
same instructional level. This is primarily what differentiates continuous 
progress from individualized models. In continuous progress models, 
students progress at their own rates, but are primarily instructed by 
teachers rather than by programmed or other individualized materials. 
In these methods, students arc constantly grouped and regrouped 
according to their levels of reading and/or math skills; this grouping is 
often done across grade lines, as in nongraded programs (see Goodlad 

^ and Anderson, 1963). 

eric i; 



30 Effective Classroom Programs 



Individualized Instruction 

Although there arc many individualized programs that have been approved 
by the JDRP, only a few qualified as effective by the criteria applied 
in this chapter. The characteristics common to these programs arc as fol- 
lows: 



1. Students work primarily on programmed or other individualized materi- 
als, with teachers working mostly with individuals rather than groups of 
students. This is the primary difference between continuous progress 
and individualized models; many continuous progress models describe 
themselves as "individualized/' but do not rely on self-instructional 
materials. 

2. As in continuous progress programs, individualized models keep careful 
records of student progress through a structured, hierarchical set of 
learning objectives. 



Cooperative Learning 

Although there arc many methods based on principles of cooperative 
learning (Slavin, 1983a, b; 1988), only two have been evaluated over periods 
of at least one semester in terms of effects on standardized tests of reading 
and math: Team Assisted Individualization (TAI) (Slavin, 1985) and Cooper- 
ative Integrated Reading and Composition (CIRC) (Stevens, Madden, 
Slavtn, and Famish, 1987). In addition to structuring cooperation among 
students, both methods also incorporate elements of continuous progress. 
The elements in common to TAI and CIRC arc as follows: 

1. Students work in four- to five-member mixed ability learning teams and 
receive certificates or other recognition based on the learning of all 
group members. Students both help one another learn and assess one 
another's skills in preparing for tests or teacher assessments, which will 
be taken without teammate help. 

2. Teachers instruct students (drawn from the different teams) who arc at 
the same level in a hierarchy of skills. Most information comes from the 
teachers, not from peers or materials; teammates help one another 
master skills, but they do not provide initial instruction to each 
other. 

3. Students arc frequently assessed, first by teammates and then by the 
teacher. Specific corrective procedures arc provided for students who 
do not meet a preset level of mastery. 

O 
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CHARACTERISTICS AND OUTCOMES OF 
EFFECTIVE MODELS 

Continuous Progress 

Tabic 2-1 lists a total of eleven programs that were categorized as 
continuous-progress models with convincing evidence of effectiveness. 
Seven of these used control group designs. 

Distar. Distar (Becker and Carninc, 1980) is a program developed at 
the University of Oregon. Distar is unusual, even within the range of 
continuous-progress programs. It provides teachers with very specific 
scripts to use in teaching reading and math and trains teachers in very 
specific methods, down to the level of how to use hand signals to elicit 
student responses and how to call on students. In describing the program, 
Distar's developers tend to emphasize the sequential, hierarchical curricu- 
lum design, direct instruction and rapid pace, and high frequency of tludcnt 
responses (Becker and Carninc, 1980). However, what defines Distar as a 
continuous progress model is that students are taught in small groups that 
arc homogeneous in skill level, arc assessed frequently on their progress 
through a well-defined skill hierarchy, and arc regrouped if necessary 
according to the results of these assessments. 

The evidence supporting the effectiveness of Distar for increasing 
student performance in certain skill areas is strong. Distar was the only one 
of the nine major programs evaluated in the national Planned Variation 
Follow Through studies conducted by Abt Associates (Stebbins, St. Pierre, 
Proper, Anderson, and Ccrva, 1977) to have consistently positive effects on 
the achievement of disadvantaged students. The effect size estimates that 
appear in Table 2-1 arc from the Abt evaluation, pooling across cohorts (sec 
Kennedy, 1978). As triable indicates, however, the effects of Distar 
appeared primarily on language and math computations tests, not on such 
higher-order skills as reading comprehension or math problem solving. Even 
on these scales, though, Distar students scored higher than all other Follow 
Through models. Note that these effects were brought about over three or 
four years of implementation, as the Abt evaluation focused on students who 
were in their respective programs from kindergarten or first to third grade. 

What makes the Abt evaluation results particularly impressive is that 
this evaluation involved many sites, matched control groups, and consistent 
measurement across non-FoIlow Through and various Follow Through 
treatments. In reviewing instructional programs for this chapter, there was 
always a nagging question of whether reports of successful evaluations sent 
to the JDRP or published in a journal are anomalies, perhaps one or two sites 
out of dozens in which the program was evaluated. In fact, in several cases, 
sites that had positive results successfully passed JDRP review even though 
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TABLE 2-1. Continuous-Progress Programs 



Program and 
Source 



Grades/ 
Subjects 



Description 



Evaluation 



Effect Sizes 



I. Programs Evaluated Using Control Qnmp Designs 



D I STAR K-6 
(Becker & Cam inc. Reading 
1980; Stcbbins ct Math 
a!.. 1977; JDRP 
#77-122. JDRP 
#80-50) 



UsSAIL 1-9 
(JDRP #76-95) Reading 
Math 



PEGASUS-PACE K-8 
(JDRP # I. JDRP Reading 
#79-1) 



ECRI 1-6 
(JDRP #74-48) Reading 



Project INSTRUCT K-3 
/JDRP #75-37) Reading 



t insttucts small groups 
. .•tg highly structured, 
scripted lessons. Students 
frequently assessed, 
regrouped. 



Combines continuous progress, 
individualized activities. 
Adapts to existing curriculum, 
materials. 

Students proceed through 17 
reading levels, gr. K-8; 
frequently assessed, 
regrouped. 

Teacher instructs small groups; 
frequent mastery checks 



Cross»gradc grouping according 
to reading skills; students 
proceed through skills at own 
rate. 



Abt evaluation found positive Gr. K-3 

effects in reading and math in (4 yrs.) 

most sites: JDRP-approved 

sites include Flint. MI: 

Dayton. OH; Flippin. AK: E 

Las Vegas. NM: Uvalde. TX: 

Kingstrce. SC; Cherokee, 

NC: Washington. DC. 
Both time series and control 

group comparisons show clear 

effects in reading and math in 

suburban school near Salt 

Lake City. Not disadvantaged. 
Matched control group design 

found convincing effects in 

reading in rural Princeton. IL: 

some students were Title I. 

but mostly not disadvantaged. 
Most evaluations fall-spring or 

spring»spring. One control 

group evaluation in suburban 

Chapter I school near Cincin- 
nati found convincing effects. 
Compared students in Lincoln. Gr. 2 

NE. schools that had 

successfully implemented 

program to matched schools. 

Not disadvantaged. 



Gr, 3-5 
(2 yrs.) 



Gr. 2 +.72 
Gr. 3 t.8() 
Gt. 4 +.50 
Gr. 5 +.39 
Gr. 6 +.43 
Gr. 4 



Rdg. Comp 
Language 
Math Comp. 
Math P.S. 



Reading 
Math 



Rdg. Voe. 
Rdg. Comp. 



Rdg. Voe. 
Rdg. Comp. 



Word Knvvl. 
Reading 



+ .07 
+ .84 
+ .57 
4.17 



♦ .45 
+ .27 



* ,53 

♦ .61 



♦ .51 

♦ .V) 



♦ .10 

♦ 21 



GEMS K-12 
(JDRP #79-2) Reading 



Early Childhood I only 

Preventative Reading 
Curriculum (JDRP 
#74-57) 



Students work in small groups 
or individually on materials at 
their own level. Frequent 
assessment* mastery tests, 
corrective instruction. 

Students identified as high risk 
given intensive continuous 
progress, diagnostic- 
prescriptive program with 
small group and individualized 
activities. 



Compared project schools with 
matched controls in suburb of 
Salt Lake City. Not 
disadvantaged. 



Compared project schools to 
matched controls in Miami. 
FL. 



Gr. I +.11 
Gr. 2 +.00 
Gr. 3 +.41 
Gr. 4 +.23 
Gr. 5 +.44 
Gr. 6 + .39 
Gr. I 



Rdg. Voc. 
Rdg. Comp. 



Par. Mng. 
WRAT 



+ .39 
+ .14 



+ .95 
+ .28 



II. Programs Evaluated Using Yea to- Year Gains 



WILASD 1-3 
(JDRP #83-2) Reading 



COMP 1-8 
(JDRP #74-114) Math 



CLIMB 1-12 
(JDRP #81-44) Reading 
Math 



Outcomes-Driven 1-8 
Developmental Reading 
Model (Mamary Math 
and ttowc, 1985; 
JDRP #85-7) 
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Combines continuous progress, 
whole class, and 
individualized instruction. 



Students taught in small groups, 
progress through 25 levels in 
8 yrs. 

Diagnostic»prcscriptivc, 
continuous progress. Title I 
teachers mostly consult, do 
some pullout. 

Students grouped across grades 
according to skills. Frequent 
mastery assessments, 
corrective instruction. Uses 
cooperative learning during 
independent practice. 

V) 



Compared cohorts in mostly 
Hispanic, low-income school 
in Wcslaco, TX, before 
program begun and then 
scv«ral years after. Clear 
differences in scores. 

FiilkO'fall trends in NCKs show 
clear gains in math. 
Columbia, MO. Not 
disadvantaged. 

Grades 4-6 Title I students in 
suburban Middlesex, NJ, 
gained significantly in 
fall-to-fall reading and mailt 
NCEs. 

Gains over several years in 
percent of students scoring 
above grade level, Johnson 
City. NY. Mostly not 
disadvantaged. 
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they were using methods that were unsuccessful at most other sites. The 
effects for Distar summarized in Taole 2-1 are averaged across all sites that 
used the program, successfully or not. 

Recent research on Distar has found that the program's effects can be 
long-lasting. Following up students from an inner-city New York elementary 
school, Distar students exceeded control in high school graduation; about 
55 percent of the former Disiar students graduated, compared to 34 percent 
of control students (Meyer, 1984). 

Despite the well-documented and widely acknowledged positive ef- 
fects of Distar, there has also been much criticism of this approach, 
principally on the basis of its use of scripted lessons and a perceived focus 
on rote skills rather than higher-order, Icarning-to-lcarn skills (e.g., see 
Calfcc, 1986). Many teachers and administrators resist Distar for similar 
reasons. Given this reality, it is important to note that Distar is only one of 
several successful continuous progress models, is the only one to use 
scripted lessons, and is one of only two (with ECRI) to emphasize the highly 
organized, teacher-centered classroom organization that many teachers find 
offensive. 

U-SAIL. The Utah System Approach to Individualized Learning 
(U-SAIL) was developed and evaluated in Utah. It is a continous progress 
program quite different from Distar. In it, students proceed through a 
hierarchical .sequence of objectives at their own rates. Most instruction is 
given to small groups, but some individual work is included. While the 
teacher is instructing skill-level groups, other students arc given indepen- 
dent, exploratory activities, with a particular emphasis on independent 
reading (sec Hales, 1983a, b). 

The JDRP submission for U-SAIL presents data from only one 
elementary school in suburban Davis County, Utah. Evidence is provided to 
show that the school was similar to the control school for three years before 
U-SAIL was begun, but gained more than the control school afterwards. 
The data in Table 2-1 represent the degree to which U-SAIL students 
outperformed control students in reading and math from grades 3-5. 
Unfortunately, we have no idea how successful the program has been in 
other schools, or whether the active involvement of the developer is needed 
to make the progiam successful. Also, no evidence is given that U-SAIL can 
be effective with disadvantaged or iow-achicving students. 

PEGASUS. This program was apparently the first to pass JDRP 
review, as it was given the first JDRP number. PEGASUS is a classic 
continuous-progress program. It organizes the reading program into seven- 
teen levels, spanning grades K-8. Students progress through these levels at 
their own rates, but arc taught in groups appropriate to their current levels. 
Within each of the levels is a continuum of skills that students must master. 
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PEGASUS was originally developed in Tuscaloosa, Alabama, but data 
could be obtained for evaluation only from a •'turnkey" site in rural Illinois. 
In this evaluation, schools in Bureau County were matched with similar 
schools in La Salle County for a one-year study. Results indicated signifi- 
cantly greater growth in reading for PEGASUS students at most grade levels 
on vocabulary as well as reading comprehension measures. However, 
although some of the students were poor, results for these students were not 
separately reported, so generalization to disadvantaged groups may not be 
warranted. 

ECRI. The Exemplary Center for Reading Instruction (ECRI), like 
U-SAIL, originally developed in Utah. However, it is quite unlike U-SAIL 
in procedures, more resembling Distar in classroom organization, rapid 
instructional pace, detailed and specific instructions for teachers, and 
frequent assessment of student progress. In ECRI, students arc usually 
assigned to three reading groups. They receive instruction in their groups, 
and then work on materials at their own rates. ECRI emphasizes teaching 
specific word attack skills and proceeding through large numbers of words in 
a short time. 

Most evaluations of ECRI presented as part of the JDRP submission 
used fall-to-spring designs. One study, in a relatively low-SES school in a 
suburb of Cincinnati, used a control group design, and the results of this 
study were used to compute the effect sizes shown in Table 2-1. Also, 
spring-to-spring data from a mostly low-SES, all-black school in rural 
Louisiana and from a Chapter 1 school in suburban Englcwood, Ohio, 
suggest that ECRI can be effective with disadvantaged and low-achieving 
students. 

Project INSTRUCT. Like PEGASUS, Project INSTRUCT is a 
classic continuous-progress program. It was developed and evaluated in 
Lincoln, Nebraska. In the program, students arc grouped according to skill 
levels, and may proceed through a hierarchy of skills at their rates. An 
evaluation of the program by Novak (1975) compared schools that success- 
fully implemented Project INSTRUCT to matched control schonU, and 
found small positive effects in reading and math. 

GEMS. Goal-Based Educational Management System (GEMS) is a 
diagnostic-prescriptive reading program developed in Utah. In this program, 
students proceed at their own rates through 200 skill levels covering grades 
K- 12. Each unit has a pretest and a posttest. According to placement tests 
or pretests, students arc placed in appropriate inst? actional groups. Teach- 
ers use a variety of teaching strategies, following which students must attain 
a score of at least 80 percent to exit the unit. Those who do not achieve this 
score arc given alternate materials and enough time as they need to 
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ultimately pass. A computer management system helps teachers keep track 
of student progress. 

An evaluation in rural Jordan, Utah, found that GEMS students 
generally exceeded control in vocabulary, although differences in reading 
comprehension were small and were not found in grades I and 2. No 
evidence is given to suggest that the program might be effective with 
disadvantaged students. 

Early Childhood Preventative Curriculum. This program is quite 
different from the others in Table 2-1 in that it is intended to be used 
intensively with high-risk first graders. These students are put into a special 
class during first grade and experience an individualized diagnostic- 
prescriptive program, in which students' individual strengths and weak- 
nesses are identified and students arc allowed to proceed at 'heir own rates. 
Most instruction is given in small, skill-level groups. 

ECPC was developed and evaluated in Miami, Florida. The evaluation 
compared the prognr* to matched as well as random controls. Overall, 
positive effects were found on the Paragraph Meaning scale of the Stanford 
Achievement Test and (to a lesser extent) on the WRAT. 

Continuous-Progress Programs Evaluated Using Year-to-Year 
Gains. In addition to the programs listed above, four additional continous- 
progress models presented convincing evidence for year-to-year gains: 
Wcslaco Individualized Reading/Language Arts Instruction and Staff Devel- 
opment Process (WILASD), Conceptually Oriented Mathematics Curricu- 
lum (COMP), Coordinated Learning Integration— Middlesex Basics 
(CLIMB), and Outcomes-Driven Developmental Model (ODDM). The 
ODDM model, developed and evaluated in Johnson City, New York, has 
received much attention in the educational literature (e.g., Mamary and 
Rowc, 1985). The procedures used in these models generally fall within the 
range represented among the other continous-progrcss programs. 

A few additional continuous-progress programs passed by the JDRP 
might also be effective, but did not present adequate evidence. For example, 
Intensive Reading Instructional Teams (IRIT; JDRP No. 74-11) presented 
only fall-to-spring percentile gains. The Cranston Comprehensive Reading 
Program (JDRP No. 82-28) used a regression-discontinuity evaluation 
design that showed an increase in student scores the year the program was 
begun, but the district also changed test forms at the same time, making the 
gains difficult to ascribe to the program with any confidence. 

Individualized Instruction 

Table 2-2 lists individualized instruction programs that presented convinc- 
ing evidence of effectiveness. Only three programs fall into this category and 
^of these, only one used a control group design. 
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TABLE 2-2. Individualized Instruction Programs 



*4t Grades/ 
Program and Source Subject 



Description 



Evaluation 



Effect Sizes 



I. Programs Evaluated Using Control Group Designs 
Mattcson 3-8 Students work on 

Four-Dimensional Reading individualized learning 
Reading Program packets 40-60% of reading 

(JDRP 77-109) time. 

II. Programs Evaluated Using Year-to-Year Gains 

AIRS 1-6 Students mostly work on 

(JDRP 74-25) Reading individualized materials, 

some small-group work. 



STAMM K-8 Structured, individualized 

(JDRP 76-87) Math program with programmed 

* materials. 



Compared project classes to 
matched control classes in 
surburban Chicago. Not 
disadvantaged. 



Comparison of spring testing 
shows cohort gains in 
suburban Andover. MA. Not 
disadvantaged. 

Evaluations in suburban 
Denver and suburban 
Atlanta show gains in spring 
NCEs. Not disadvantaged. 



Gr. 3-6 Reading +.57 • 
(3 yrs.) 
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Matteson Four-Dimensional Reading Program. This program, de- 
veloped and evaluated in a Chicago suburb, provides students with individu- 
alized learning packets through which students proceed at their own rates. 
Some small- and large-group activities are used, and the individualized v/ork 
occupies only 40-60 percent of the total reading period, so the Matteson 
model cannot be called a "pure" individualized program. An evaluation 
comparing matched schools found that fourth graders in the Matteson 
program gained more than control students in Stanford Achievement Test 
scores over a three-year period. However, no evidence is given that would 
suggest that this program could be successfully applied to at-risk students, 

AIRS. Andovcr's Individualized Reading System (AIRS) is another 
self-paced instructional model developed and evaluated in a well-to-do 
school district, Andovcr, Massachusetts. AIRS is a classic, comprehensive 
individualized approach that uses programmed self-instructional materials. 
Almost all language arts instruction is delivered through the materials with 
teachers, aides, and parent volunteers giving one-to-one assistance as 
needed. Comparisons of Stanford Achievement Test scores in student 
cohorts before and after the AIRS program was introduced show clear gains 
that were maintained over several years. An AIRS evaluation in rural Rhode 
Island also found achievement gains each year after the program was 
installed. Again, no evidence is given to suggest that AIRS would be 
effective with disadvantaged or low-achieving students. 

STAMM. Systematic Teaching and Measuring Mathematics 
(STAMM) was developed and initially evaluated in Jefferson County, 
Colorado, a Denver suburb. Like AIRS, STAMM is a fully individualized 
instructional program, relying primarily on self-instructional materials and 
one-to-one instruction by teachers and aides. Evaluations in Jefferson 
County and in an Atlanta suburb showed that students who experienced 
STAMM scored higher on the CTBS than did students in earlier cohorts. 
Special forms of STAMM exist for use with Chapter 1 and learning-disabled 
students, but evaluations with these or other at-risk students arc not known 
to exist. 

Other Individualized Programs. What is noteworthy about Table 2-2 
is not so much the programs listed there as the programs not listed. A large 
number of JDRP-approvcd programs used individualized models, and the 
broader educational literature has many studies of such methods. Yet very 
few of these present convincing evidence of effectiveness 

One individualized program that was carefully considered for inclusion 
in Table 2-2 is the University of Kansas Behavior Analysis Program (Ramp 
and Rhine, 1981). This program is generally acknowledged to have been the 
O ;ond most effective of the Planned Variation Follow Through models. 
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after Distar. Also, two Behavior Analysis sites, Trenton, New Jersey ( JDRP 
No. 77-139) and Waukcgan, Illinois (JDRP No. 77-126), passed JDRP 
review with convincing control-group evaluations (conducted as part of the 
overall Abt evaluation). However, site variability for Behavior Analysis was 
extreme; in some locations, especially New York City, Behavior Analysis 
students scored significantly worse than non-Follow Through students. 
Overall, Kennedy (J978) reports effect sizes of -.12 for reading comprehen- 
sion and +,02 for language. Only in math computations is there positive 
evidence (ES = +.28), but math problem-solving scores slightly favored the 
control group (ES = -.06). 

Another individualized model, the Responsive Early Childhood Edu- 
cation Program (RECEP; JDRP No. 77-154) also presents a paradox in that 
data submitted to the JDRP show some positive effects but the overall 
combined results from all Planned Variation sites did not. Several other 
individualized programs compared experimental to "matched" control 
groups without establishing that the control groups were initially equivalent 
(e.g., Randolph County Follow Through, JDRP No. 77-149), or presented 
only minimal spring-to-spring gains (e.g.. Personalized Classroom Manage- 
ment System, JDRP No. 78-170). 

The JDRP programs that were excluded from Tabic 2-2 rarely provide 
data that could indicate that the programs arc not effective, as such 
programs would be unlikely to pass JDRP review. However, in the broader 
education literature there have been several good-quality evaluations of 
individualized programs that have indicated few positive effects (sec Horak, 
1981; Miller, 1976; Thompson, 1975). When positive effects of individualized 
programs have been found, they arc often in studies with poor methodolo- 
gical characteristics. For example, Wang and Walbcrg (1983) claimed 
positive effects of the University of Pittsburgh's Adaptive Learning Envi- 
ronments Model (ALEM) presenting comparisons of "expected** scores 
based on national norms to scores of students in the ALEM program. 
However, the one study of this method to use a control group found no 
achievement differences (Wang and Birch, 1984), 

Cooperative Learning 

Table 2-3 lists two cooperative learning programs, TAI and CIRC, both of 
which were developed and evaluated at the Johns Hopkins University 
Center for Research on Elementary and Middle Schools. All four of the 
studies that evaluated these models (plus four additional studies that had 
durations of less than a semester) used control group designs. 

TAI. Team Accelerated Instruction or TAI (Slavin, 1985) combines 
cooperative learning with a continuous progress approach to mathematics 
instruction. In this model, students are assigned to four- to five-member. 



TABLE 2-3. Cooperative Learning Programs 



Program and Source 



Grade*/ 
Subjects 



Description 



Evaluation 



Effect Sizes 



I. Programs Evaluated Using Control Group Designs 



Team Accelerated 
Instruction (Slavin ct 
al.. 1984: Slavin & 
Karwcit. 1985: JDRP 
#84-5) 



Cooperative Integrated 
Reading & 
Composition 
(Stevens et al.. 1987) 



3-6 Students work on 
Math programmed materials 
in mixed-ability teams 
while teachers teach 
same-ability teaching 
groups. 



3-5 Students work in 
Reading mixed-ability teams 
Writing while teacher teaches 
reading groups. 



In three studies, two 
with random 
assignment, one 
matched. TAI classes 
exceeded control. 
One study in rural 
MD. one in suburban 
MD. one in urban 
DE. 

CIRC classes exceeded 
matched control 
classes in study in 
suburbs of Baltimore. 



Full Sample 
(Gr. 3-6) 

Mainstreamed 
Students 



Math Comp. 
Math C&A 

Math Comp. 
Math C&A 



Full Sample 
(Gr. 3-4) 



Remedial 
Students 



Rdg. Voc. 
Rdg. Comp. 
Oral Rdg. 
Language 

Rdg. Voc. 
Rdg. Comp. 
Oral Rdg. 
Language 



Mainstreamed Rdg. Voc. 
Students Rdg, Comp. 

Language 



+ .50 
+ .06 

+ .38 
+ .47 



+ .12 
+ .35 
+ .54 
+ .30 

«.26 
+ .40 
+ .7! 

■♦.37 

+ .90 
+ .99 
.00 
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mixed-ability learning teams, and they arc also assigned to skill-level groups 
on the basis of a placement test. Teachers instruct the skill-level groups on 
the concepts of mathematics, after which students return to their teams and 
work on self-instructional materials. Teammates frequently check one 
another's work against answer sheets, assist one another with difficult 
problems, and prepare one another for quizzes. Quizzes are taken apart 
from the team area, and achieving mastery score on quizzes adds points to 
students' team scores. At the end of each week, teams that exceed a 
prcestablishcd criterion in terms of numbers of units mastered may cam 
certificates or other rewards. Specific procedures and parallel assessments 
arc provided for students who do not achieve at a mastery level. 

TAI has been evaluated in six field experiments, but only three of these 
met the onc-scmcstcr duration requirement for inclusion in Table 2-3. Two 
of the three studies (Slavin and Karwcit, 1985) arc the only studies listed in 
Tables 2-1 through 2-3 that randomly assigned teachers to treatments. One 
of these took place in and around Hagcrstown, Maryland, and the other in 
inner-city Wilmington, Delaware. The Hagcrstown study compared TA! to a 
control group and to the Missouri Mathematics Program (Good, Grouws, 
and Ebmeicr, 1983), a whole-class instructional method that incorporates 
teaching behaviors derived from process-product research. The Wilmington 
study compared TAI to the Missouri Mathematics Program only; and a study 
in a Baltimore suburb (Slavin, Madden, and Lcavcy, 1984) compared TAI to 
an untreated control group. 

Results of all three studies favored the TAI classes, with the largest 
effects in the Wilmington study. Separate analyses for mainstrcamcd 
academically handicapped students in the suburban study also found posi- 
tive effects for these students. 

CIRC. Cooperative Integrated Reading and Composition (Madden, 
Slavin, r,nd Stevens, 1986; Stevens, Madden, Slavin, and Famish, 1987) 
uses a combination of mixed-ability, cooperative work groups and skill- 
based reading groups to teach reading, language arts, and writing in the 
upper elementary grades. In CIRC, students arc assigned to pairs within 
their reading groups, and then the pairs arc assigned to four- to five-member 
teams, so that the teams arc heterogeneous. During reading periods, 
teachers work with reading groups over the course of each week to set a 
purpose for reading, introduce vocabulary, discuss the characters, setting, 
problems, and problem solutions in narrative stories, and so on. Students 
back in their team areas work in pairs on a prescribed scries of activities, 
including reading basal stories to one another, identifying elements of story 
structure, predicting story outcomes, practicing vocabulary, word lists, and 
spelling, and writing in response to stories. Each week teachers provide 
instruction in such reading comprehension skills as finding the main idea. 
Students cam points for their teams based on the sum of their individual 



42 Effective Classroom Programs 

performances on regular quizzes. The CIRC language arts program is based 
on a writing process model, with students working together in their teams to 
plan, draft, revise, edit, and ultimately publish compositions. Language 
mechanics instruction is provided in the context of students' writing. 

The semester-long study that evaluated CIRC found positive effects in 
comparison to matched control classes on CTBS reading comprehension 
reading vocabulary, language expression, and language mechanics scales' 
Separate analyses for mainstrcamcd academically handicapped students 
found strong effects on both reading scales but not on the language 
measures: for remedial reading students (mostly Chapter 1). effects were 
similar to those in the full sample. 

Other Research on Cooperative Learning. Although the four studies 
cited above arc the only ones to use standardized reading and math 
measures at the elementary level in studies of at least a semester, several 
additional studies support the achievement effects of cooperative learning 
Two shorter studies of TAI found positive effects on math computations 
measures (Slavin. Leavcy. and Madden. 1984). though a third found no 
differences (Oishi, Slavin. and Madden. 1983). A twelve-week study of 
CIRC (Madden. Stevens, and Slavin. 1986) found positive effects on CTBS 
reading comprehension, reading vocabulary, language expression, and spell- 
ing scales. Other cooperative learning methods, such as Student Teams- 
Achievement Divisions (STAD) and Tcams-Games-Tournamcnts (TGT) 
have had positive effects in mathematics, language arts, and other subjects 
at the elementary and secondary levels (Slavin. 1983a. b: 1988). In addition 
all of the cooperative learning methods have had positive effects on such 
outcomes as nice relations, acceptance of mainstrcamcd students and 
self-esteem (Slavin. 1983a). 



Other Classroom Models 

Considering that there arc more than 400 programs passed by the JDRP plus 
many others evaluated and reported in the educational literature, how is it 
that only sixteen programs in three categories are emphasized here? 

Essentially, this chapter took a conservative approach in identifying 
effective programs. Only programs with convincing evidence of effective- 
ness were listed, and even those arc not all certain, as there is always the 
possibility that data submitted to JDRP or published in journals is taken from 
■me or more selected sites and is not representative of all program sites Yet 
it seems certain that there arc more than sixteen effective classroom 
programs. 

This s-cction discusses the state of the evidence on well-known or 
widely used models not represented among those listed in Tables 2-1 
through 2-3. 
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Developmental! Humanistic Models. One major category of pro- 
grams not listed in this review is models that might be categorized as 
"developmental/humanistic/* This includes the open classroom, Piagctian- 
bascd models, and other types of programs in which students are given 
choices of activities appropriate to their developmental levels, and arc 
encouraged to discover and experience language and mathematical concepts 
rather than to master a prccstablishcd scries of skills. 

One important example of the developmental/humanistic approach is 
the program developed by the Bank Street College of Education (Gilkcson, 
Smithbcrg. Bowman, and Rhine, 1981). Several sites using the Bank Street 
Program passed JDRP review (JDRP No. 77-156). However, the overall 
effects of this program in the Planned Variation evaluation were about zero 
in reading and language and negative in math. The High Scopc/Cognitivcly 
Oriented Curriculum (Wcikart, Hohmann, and Rhine* 1981 a child- 
centered, Piagctian model that extends into the elementary grades many of 
the elements of the Perry Preschool Program (Wcikart, 1974), also has 
several JDRP-approvcd sites (JDRP No. 77-123), but came out near the 
bottom of the list in the overall Abt evaluations (Kennedy, 1978). What these 
findings may imply is that the developmental /humanistic models can be 
effective, as evidenced by their apparent success in several sites, but the 
evidence for this or evidence of what is required to ensure thcii success arc 
essentially lacking. Also, it should be noted that critics of the Abt evaluation 
(e.g.. House, Glass, McLean, and Walker, 1978) have argued that the use of 
standardized achievement tests may have biased the evaluation in favor of 
such highly structured program.* as Distarand Behavior Analysis. However, 
the Abt evaluations focused on children who had been in their respective 
models three or four years. If the programs had any important effects on 
students' cognitive performance, a standardized test should have registered 
some (if not all) of those effects. 

Group-Based Mastery Learning. Another widely used approach to 
school improvement is group-based mastery learning (Block and Anderson, 
1975; Guskcy and Gates, 1985). The teacher assesses student learning at the 
end of a scries of lessons. Those who achieve at a prccstablishcd level of 
mastery (e.g., 80 percent) are given enrichment activities, while others arc 
given corrective instruction designed to bring them up to the mastery 
criterion. Claims of substantial positive effects of group-based mastery 
learning on student achievement arc based largely on very brief experi- 
ments. In a review of research on group-tascd mastery learning, Slavin 
(1987a) found that in studies of at least four weeks' duration there were no 
significant positive effects of mastery learning on standardized measures, 
and mudest effects (median ES = +.25) on cxpcrimcntcr-madc measures. 
However, it is important to note that the concepts of mastery assessment 
and corrective instruction arc central to all three categories of effective 
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programs emphasized in this review, and some of the continuous-progress 
programs, such as GEMS and the Outcomes-Driven Developmental model, 
identify themselves as mastery learning models (sec Lcvinc, !985). 

PRINCIPLES OF EFFECTIVE CLASSROOM 
PROGRAMS FOR STUDENTS AT RISK 

The purpose of this chapter is not only to identify individual program*, but 
also to discover principles that underlie effective models for students at risk. 
The effective programs identified as an extensive review of the literature fell 
into three categories: continuous progress, individualized instruction, and 
cooperative learning. However, the three effective individualized instruc- 
tion models were all developed and evaluated in middle-class, nondisadvan- 
taged areas in which few students could be catcgoriczcd as "at risk" for 
learning problems. Also, the broader literature has not tended to support the 
use of individualized, programmed instructional models. AIRS, STAMM, 
and the Mattcson 4-D models may have unique features that could make 
them effective with at-risk students, but given the past history of research on 
individualized programs, this remains to be seen. 

It is in the remaining two categories that the greatest confidence can be 
placed. In the case of the continuous progress models, it is important to note 
not only the quality of the individual studies but also the number of 
successful models that began from very different theoretical bases and 
ended up with similar programs. Some of these models describe themselves 
as ••individualized" (e.g., U-SAIL). but in order to provide adequate 
instruction to all students, incorporated groupings of students according to 
skill levels. Others (e.g.. GEMS, ODDM) describe themselves as "mastery 
learning," but incorporated subgrouping of students to accommodate 
instruction to individual needs. Developers of Distar describe the model as 
"direct instruction." yet the skill-level grouping, frequent assessment, and 
progress through a structured hierarchy of skills have little in common with 
more recent class-paced forms of "direct instruction" such as the Missouri 
Mathematics Program (Good. Grouws, and Ebmcicr. 1983). 

Although very different in many respects, the effective cooperative 
learning programs listed in Table 2-3 have many features in common with 
the continuous progress models. First, even though students do most of their 
work in mixed-ability, cooperative learning teams, instruction in TAI and 
CIRC is directed toward small, homogeneous groups of students, so 
instruction is at a level appropriate to student needs. Second, instruction 
comes from the teacher, not from written materials or peers. Third, 
students' progress is constantly assessed and specific procedures exist to 
remediate any small problems before they become large ones. Finally, in 
TAI and in the reading comprehension component of CIRC, there is a 
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structured hierarchy of skills that students must master. Cooperative 
learning methods add to these the opportunity for students to explain 
concepts to one another and help one another study, team incentives based 
on the learning of team members, and student responsibility for many 
elements of classroom management. This last feature may be particularly 
important in today's lean times; TA1 and CIRC have always been evaluated 
in classrooms in which there were no aides or volunteers, whereas con- 
tinuous-progress and individualized models have almost always had addi- 
tional personnel. 

What emerges from an examination of the effective classroom pro- 
grams is the following characterization: Consistently effective classroom 
programs accommodate instruction to individual needs while maximizing 
direct instruction, and they frequently assess student progress through a 
structured hierarchy of skills . 

The first part of this statement, the need to accommodate instruction to 
individual needs while maximizing direct instruction, is also supported by 
research on ability grouping in elementary schools, recently reviewed by 
Slavin (1987b). This review found that while assigning students to self- 
contained classes on the basis of general ability was ineffective , two forms of 
ability grouping did have considerable support in good-quality experimental 
studies. One was the Joplin Plan, in which students are regrouped for 
reading across grade lines into reading classes all operating at one reading 
level. This plan obviates the need for grouping within the reading class, 
giving teachers the opportunity to spend all of the reading period in direct 
instruction. Studies of the Joplin Plan and closely related forms of non- 
graded plans found consistent positive effects on standardized reading 
measures (median ES = +.44). 

The second form of ability grouping found in the Slavin (1987b) review 
to be instructional^ effective was within-class ability grouping in math 
(median ES = + .34). Use of math groups, especially if the number of groups 
is kept to two or three, strikes a balance between accommodating instruction 
to individual needs and providing adequate direct instruction. Since there is 
a need for independent scatwork in math, grouping within the class to meet 
students' diverse needs and having the teacher present two or three lessons 
at different levels may be the most efficient use of instructional time. 

Slavin (1987c) has argued that to make a meaningful difference in 
student achievement, four elements of classroom organization must be 
simultaneously addressed: quality of instruction, appropriate level of in- 
struction, incentive, and time. The continuous programs and cooperative 
learning methods address most of these elements, in particular in providing 
appropriate levels of instruction (by grouping students according to skill 
level) while maintaining an adequate quality of instruction (by having the 
teacher be responsible for direct instruction and by having a well thought out 
sequence of instructional objectives). Cooperative learning adds to the 
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incentive clement the use of team rewards based on team members' 
learning, and both types of programs emphasize effective use of time 
through careful structuring of classroom activities. 

The importance of accommodating student needs while maintaining 
adequate direct instruction is perhaps greatest for at-risk students. Students 
who begin to experience difficulties in their early years in school are unlikely 
to be able to keep up with the class pace or grade-level expectations. In 
response to this, schools provide remedial and special education for students 
who arc falling behind. Yet these programs rarely accelerate students 
enough to enable them to catch up with their classmates (Kennedy, Birman, 
and Dcmaiinc, 1986; Madden and Slavin, 1983; Slavin, !987d: Carter, 1984). 
Building classroom models able to meet diverse needs would seem to be a 
better approach to the problems of at-risk students than creating a parallel 
instructional system, which creates a large set of new problems in terms of 
coordination with the regular program, transitions, and so on (Johnston, 
Allington, and Afflcrbach, 1985). 



CONCLUSIONS 

This chapter reviewed all sources of evidence concerning classroom pro- 
grams that have been or could be applied to accelerating the reading and 
math achievement of at-risk students. Among programs whose evaluations 
used control group designs with good evidence of initial experimental- 
control equivalence, study durations of at least a semester, and standardized 
measures of reading and math, two categories of programs emerged as 
particularly effective: continuous progress and cooperative learning. These 
approaches have in common the use of instruction to small, skill-based 
groups: instruction from the teacher rather than from materials or peers; and 
frequent assessment of student progress through a hierarchical curriculum. 

However, while this review is based on the best evidence currently 
available concerning the achievement effects of practical classroom instruc- 
tional models, it is important to reiterate that this evidence is far from 
definitive. Even the carefully selected programs emphasized in this review 
could prove to be ineffective in later research: as noted previously, it is 
always possible that data submitted to the JDRP or to journals is from the 
best sites, not from a representative sample of implementations. It is also 
likely that many programs excluded from this review will prove to be 
effective or could be modified to be effective. Further, while it is of some use 
to know that Program X is effective in comparison to traditional methods, 
there is much more we must understand to make this information useful in 
informing us about program design and classroom instruction in general. 
What elements of Program X account for its effects? For whom is it 
effective? Docs it have important side effects? 

We arc still at a primitive state in understanding effective instruction 
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for students at risk. What is needed is a renewed focus on development, 
evaluation, and component analyses of programs based on many models of 
teaching and learning. 

One approach to identifying effective, transportable instructional mod- 
els for students at risk would be learning from the mistakes of the Planned 
Variation Follow Through evaluation but trying again to accomplish its 
objectives. A very small proportion of the enormous Chapter 1 budget, for 
example, could fund development of promising models, evaluation and 
component analyses of such models, and ultimately evaluation of the models 
in new sites with multiple measures (not only standardized tests), with 
random assignment of programs to schools (sec Slavin, 1987d). In this way, 
we could finally achieve the ambitious goals of Chapter 1 to make a 
meaningful difference in the lives of disadvantaged students. 

While we cannot conclude that the search for effective classroom 
programs for at-risk students is over, the findings of this review do have 
important implications for current practice. It identified several programs 
that have demonstrated potential for increasing the achievement of disad- 
vantaged and low-achieving students and identified program elements that 
seem to characterize effective models. All the identified programs arc 
designed to be replicablc and transportable; in most cases the developers 
have training staffs who work to help school districts successfully implement 
the models. However, there are two notes of caution that practitioners 
should bear in mind. First, all of the effective models identified in this review 
arc complex and arc quite different from usual practice. Adequate pro- 
visions for training, followup, and monitoring of project implementations are 
imperative. Any of these programs can fail (and most have failed) when 
inadequate resource* have been devoted to implementation. Program imple- 
mentations should start on a small scale, beginning with volunteers and only 
gradually expanding from a solid base of success. Second, districts should 
conduct their own evaluations of new programs, using random assignment 
or careful matching to compare program users and nonuscrs. The fact that a 
program was effective elsewhere suggests but docs not guarantee that it will 
be successful in any particular district. 

The study for reliably effective programs for students who arc at risk 
for school failure is a task of great importance. This chapter and others in 
this book arc intended to sum up where we arc now, in the hope that we can 
learn from the past in helping direct future research, development, and 
evaluation into effective instruction for our neediest students. 
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From the beginning, one of the guiding principles of compensatory educa- 
tion has been that compensatory services must supplement, not supplant, 
the educational programs provided to eligible students. Low-achieving 
students in schools that arc eligible for Chapter I (formerly Title I) services 
must receive something identifiably "extra." Evaluators located in every 
state check to sec that Chapter I services arc in fact supplementing rather 
than supplanting regular education programs. 

Partly because of the "supplement, not supplant" regulations, schools 
have overwhelmingly relied on pullout models as a means of providing 
Chapter I or Title I services. Most often, students who qualify for compen- 
satory services are taken out of their regular classrooms for thirty to forty 
minutes of remedial instruction in reading and/or mathematics. This ar- 
rangement has the advantage of making it clear that Chapter 1 services arc 
supplementary, as special personnel and materials arc clearly allocated only 
to identified students. In in-class alternatives to pullout, maintaining the 
distinction between who is served and who is not served is more difficult. At 
least partly for this reason, Chapter I programs have overwhelmingly 
chosen pullout as the mode of service delivery, A study conducted during 
the 1981-82 school year found that pullout outnumbered in-class models by 
nine to one (Advanced Technology, 1983), and in a more recent study of 
schools specially chosen to represent a variety of service delivery models, 
fifteen of seventeen elementary schools used pullout in reading and/or math 
(Rowan, Guthrie, Lee, and Guthrie, 1986). 

Despite the many criticisms of pullout (e.g.. see Glass and Smith, 1977; 
Johnston, Allington, and Afflcrbach, 1985: Archambault, 1987), pullout is 
likely to remain as a widely used means of providing compensatory 
education services. In any case, as many have noted previously (eg., 



Effective Pullout Programs 



Archambault, 1987; Kennedy, Birman, and Dcmalinc 1986), the important 
issue is not the setting in which compensatory services arc provided, but the 
quality of the programs provided in the setting. This chapter reviews 
research on effective pullout programs for elementary students who are at 
risk for school failure. 



SCOPE OF REVIEW 

The focus of this chapter is on programs provided to students who have been 
identified as being in need of remedial services that arc implemented outside 
of the regular classroom. This excludes programs implemented in the regular 
class (which arc reviewed in Chapter 2) and self-contained "replacement" 
or special education programs in which students arc assigned to a remedial 
class for most or all of their school day. The emphasis of this review is on 
programs that could be (or have been) readily replicated by schools other 
than those that developed them. For example, many school districts have 
developed exemplary Chapter I programs (see Griswold, Cotton, and 
Hansen, 1986), but the features of these programs arc c|tcrt uniquely 
adapted to the situations, personnel, and students of the district and were 
not designed to be replicated as such in other districts. Also, a few studies 
have identified variables related to effective Chapter 1 programs (sec 
Chapter 9; Cooley and Lcinhardt, 1980). In contrast to these, the programs 
emphasized in this review arc models that have well-specified manuals, 
materials, training procedures, and other features that characterize pro- 
grams intended for replication by others. 



PROCEDURES 

The procedures used in this review arc the Mime as those described in 
Chapter 2. Essentially, the review procedure was an adaptation of best- 
evidence synthesis (Slavin, 1986)— a method that combines the features of 
meta-analytic and traditional narrative reviews. The initial literature search 
examined many sources, including published and unpublished articles, 
school district reports, and government documents. Requests for informa- 
tion were sent to all of the 1 16 exemplary compensatory education programs 
identified by GrLwold, Cotton, and Hansen (1986). However, as in Chapter 
2, the most useful sources of information were reports submitted to the Joint 
Dissemination Review Panel ( JDRP), a U.S. Department of Education panel 
that reviews evaluations of programs supported by federal funds. 

The same set of substantive and methodological inclusion criteria as 
were used in Chapter 2 were applied in deciding which programs to 
emphasi/.e in this review. In brief, programs had to assess effects on 
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standardized reading and/or math scales in studies of at least a semester's 
duration. Programs had to be compared to matched or randomized control 
grcups, or year-to-year gains of at least 7 NCEs had to be reported 
(fall-to-spring NCE gains, which have been found to have serious method- 
ological problems, were not considered adequate evidence of program 
effects; sec Gabriel, Anderson, Benson, Gordon, Hill, Pfanncnsticl, and 
Stonchiil, 1985). All programs presented data to indicate that they were 
effective with students eligible for Chapter 1 services or other at-risk 
students. 

CATEGORIES OF EFFECTIVE 
PULLOUT MODELS 

The effective pullout programs for students at risk of school failure fell into 
three broad categories. One, diagnostic -presi riptive programs, covers the 
great majority of existing Chapter I programs. In this model, students 
identified as being in need of remedial services are carefully assessed and 
f tticn instruction appropriate to their nccJs isjfeivcn by a leather in a location^ 
separate from the regular classroom (or occasionally within the classroom 
setting). Instruction may be given to individuals or to small groups within a 
pullout class of roughly three to eight students. The second category is 
tutoring programs, in which tutors work onc-on-onc with identified tutecs. 
Tutors may be teachers, paraprofcssionals, volunteers, or older students. In 
the third category, computer-assisted instruction (CA1), students work on 
computers for at least part of their remedial reading or math time. It should 
be noted that many diagnostic-prescriptive programs use computers for 
management (as opposed to instructional) purposes, but these arc not 
categorized as CAI. 

CHARACTERISTICS AND OUTCOMES OF 
EFFECTIVE PULLOUT MODELS 

Diagnostic-Prescriptive Programs 

Because of the widespread use of diagnostic-prescriptive pullout programs, 
the data available on such programs is extensive. Almost all of the 
exemplary Chapter 1 programs identified by Griswold, Cotton, and Hansen 
(1986) used diagnostic-prescriptive models, as did many of the programs 
certified by the JDRP. However, very few of these presented convincing 
evidence of effectiveness. In most cases, these programs were identified as 
exemplary on the basis of fall-to-spring gains :n normal curve equivalent 
scores (NCI's) However, most districts presented fall and spring scores foi 
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several years, and with few exceptions these indicate little or no growth in 
fall scores or spring scores over time. The typical pattern is a fall-to-spring 
gain of eight to twelve points, followed by a decline over the summer of the 
same magnitude. This cannot be seen as convincing evidence that the 
programs arc not effective: it could be that the lack of year-to-year growth in 
scores is due to dropping of students with high scores from the Chapter I 
lists, or that scores increased when new programs were introduced but then 
remained stable during the years for which data were provided, or (here may 
be other methodological or substantive factors involved. 

However, there are a few diagnostic-prescriptive programs that do 
present more convincing evidence of effectiveness. Information on these 
programs is provided in Table 3-1. 

Only two of the successful diagnostic-prescriptive programs used 
control group designs. One of these. Project Conquest (JDRP No. 74-12) 
mostly provides remedial services to groups of six students. However, early 
h the year, students receive one-to-one tutoring until they acquire word 
perception skills. Students arc carefully assessed by special "reading 
clinicians" and given individual prescriptions. 

An evaluation of Project Conquest was conducted in low-income, 
mostly black schools in East St. Louis, Illinois. Project Conquest students 
made greater gains than control students at grade l&cls from 1-6 on many 
standardized leading measures. However, these data arc difficult to interpret 
because no evidence is given that the experimental and control classes were 
initially equivalent. A more sophisticated analysis was conducted using 
fifth-grade data and correcting for pretest differences, and this also showed a 
clear advantage for the program. 

One of the exemplary Chapter I programs identified by Griswold, 
Cotton, and Hansen (1986) used a control group design to evaluate its 
diagnostic-prescriptive pullout model. This is the Oklahoma City, Okla- 
homa, Chapter I program, a classic diagnostic-prescriptive pullout model. In 
this program, students arc assessed in terms of sk'Ms and learning styles and 
then given instruction appropriate to their needs, individually or in small 
groups. Sonic use is made of computer-assisted instruction; the evaluation 
of this component is discussed later in the section on CAI. 

The evaluation of the Oklahoma City diagnostic-prescriptive pullout 
program (Kimball, Crawford, and Raia, 1985) involved careful matching on 
prcscorcs of students who received Chapter I services with those who did 
not. The same procedures were followed in two successive years. Results 
indicated that Chapter I students gained an average of 3 NCEs more in math 
than their counterparts who did not receive Chapter I services. Using the 
standard deviation of NCEs of about 12 (estimated by Gabriel, Cotton, and 
Hansen, 1985), this is equivalent to an effect size of +0.25. However, gains 
in reading were much smaller and, in one year, were not statistically 
significant (ES = +.J2). 



TABLE 3-1. Diagnostic-Prescriptive Pullout Programs 



Program and Soun v 



Grades! 
Subjects 



Description 



Evaluation 



Effect Sizes 



/. Programs Evaluated Using Control Group Designs 



Project Conquest 1-6 
(JDRP # 74-12) Reading 



individual and 
small-group 
instruction provided 
outside of class to 
remediate specific 
deficits identified by 
individtial 
assessment. 
Extensive inservice. 



Students in East St. 
Louis, IL. 
compared to 
"comparable" 
group. Gains 
significantly higher 
in Conquest 
schools. 



Gr. 1-6 



Cannot be 
estimated 



Oklahoma City. OK 1-6 

Chapter i Program Reading 

(Kimball. Crawford. Math 
and Raia. 1985) 



District-wide 
management 
objectives used to 
coordinate regular 
class and Chapter I 
objectives 



Chapter t students 
compared to 
matched unserved 
students. 



i' i 



Gr. 2-6 CAT (reading) +.25 

MAT (math) *.I2 



//. Programs Evaluated Using YeaMo-Year Gains 
Lincoln, NE Chapter 1 K-6 

Management and Reading 

Coordination Project Maih 



District-level 
objectives and 
monitoring system 
managed by 
computer, emphasis 
on day-to-day 
coordination 
between Chapter 1 
and regular class. 



Spring-to-spring gain* Gr. 1-6 
of 4-5 NCEs in 
reading, 5-9 NCEs 
in math. 



Diagnostic-Prescriptive 3-5 
Arithmetic Program Math 



Columbia. MO 2-6 
Chapter 1 Math Math 
Program 



Math lab approach 
emphasizes 
hands-on 
experience, 
manipulates. 

Program emphasizes 
coordination 
between Chapter I 
and regular class, 
use of 

manipulativcs. 



Spring-lo-spring gains 
of 10.4 NCEs on 
Stanford Diagnostic 
Arithmetic Test. 



Gr. 3-5 
fstudems 2 yr. 

or more 

below grade 

level) 



Fail-to-fall gains of 8.7 Gr. 2-6 
NCEs on Iowa Test 
of Basic Skills. 
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The Baltimore, Maryland, school system also conducted a comparison 
of students served by Chapter 1 pullout programs and matched students 
eligible for Chapter 1 but not served. This evaluation found that served 
and unserved students made about the same fall-to-spring gains in reading 
and math scores (Granick, Quiglcy, Katzcncllcnbogcn, and Richardson, 



Other than Project Conquest, only one JDRP-approvcd program 
appears in Table 3-1: the Diagnostic-Prescriptive Arithmetic Program 
(JDRP No. 74-68). This model, developed and evaluated in Statcn Island, 
New York, uses a math lab approach to remediation of deficits in mathemat- 
ics. Individualized and small-grcup activities keyed to problem areas 
identified by the Stanford Diagnostic Mathematics Test arc provided to 
students. Spring-to-spring gains of approximately 10.4 NCEs were made; 
however, since the Stanford Diagnostic Mathematics Test was also used as 
the posttcst, there is a possibility that this program was essentially teaching 
the test. 

Two additional programs from the list of effective Chapter 1 programs 
also presented evidence of effectiveness based on outstanding year-to-year 
gains. One of these is the Lincoln, Nebraska, Chapter 1 program. In this 
program, a computer management system was introduced to help handle 
diagnostic tests, assign students to the program, maintain coordination 
between the regular teacher and the Chapter 1 teacher, monitor student 
progress, and evaluate student success. During the years when the program 
was implemented, Chapter 1 students showed steady gains in spring scores 
in reading and math, which appear to be due to the addition of the computer 
management system rather than to the diagnostic-prescriptive pullout model 
already in use in the district (Wcathcrl, J986). 

Another apparently effective Chapter I model cited by Oris wold. 
Cotton, and Hansen (1986) is the Columbia. Missouri, Public Schools' 
Chapter I mathematics program. This model places considerable empha- 
sis on coordination of instruction between Chapter 1 and regular class- 
room teachers; forms indicating specific objectives students arc working 
on arc passed back and forth between Chapter I pullout teachers ai.d 
regular classroom teachers, and time is set aside for Chapter 1 and regular 
teachers to meet. Program data indicate fall-to-fall gains of 8.7 NCEs in 
math. 

The evidence from the Chapter I programs listed in Table 3- 1 indicates 
that diagnostic-prescriptive programs am be effective, but their effects do 
not generally appear to be large. However, the almost universal use of 
fall-to-spring gains (or year-to-year gains at best) as the criteria of program 
effects means that we really know little about which programs (or com- 
ponents of programs) are having important effects on the students they 
serve. 
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Tutoring Programs 



A wide variety of tutorial programs using tutors ranging from experienced, 
specially trained teachers to paraprofessional aides to older, often low- 
achieving students have demonstrated considerable effectiveness in improv- 
ing students* achievement in reading and math in grades I -6. These effective 
programs fall essentially into two groups: those that were designed as 
remedial programs and those that were designed as preventative programs. 

Remedial Tutorial Programs. Three models of remedial uses of 
tutoring were identified. Descriptions of thorn appear in Table 3-2. The two 
strongest models. Training for Turnabout Volunteers (TTV) and School 
Volunteer Development Project (SVDP), were developed in Dade County, 
Florida. TTV used volunteer junior high school students who took tutoring 
as an elective class to tutor low-achieving first through sixth graders in 
reading and math. Tutors were required to be able to read and compute at 
the fifth-grade level, which allowed underachieving students to act as tutors. 
Tutors received considerable training involving a specified structured cur- 
riculum before they began to tutor, and spent one day out of five in group 
supervision, receiving continuing training in the specific subject matter being 
taught as well as in tutoring skills such as rewarding the tutccs' successes, 
refraining from criticising failures, organizing the work to be presented, and 
soon. The tutorial materials were not programmed. Tutors drew from a wide 
variety of materials deemed to be useful for teaching the needed skills. The 
gains made by students working with the well-trained tutors were compared 
to the gains made by students working with tutors who did not receive 
continuing supervision, but otherwise worked for similar amounts of time 
with similar students using the same kinds of materials. Students worked 
together four days a week for sixteen weeks for forty minutes a day. 

Gains were significantly greater for both tutors and tutccs when the 
tutors received continuing training. Students tutored by trained tutors in 
math gained 0.93 standard deviations more than those tutored by untrained 
tutors on the Metropolitan Achievement Test. In reading, the tutccs of 
trained tutors gained 0,51 standard deviations more on the Metropolitan than 
their comparison group. Trained tutors gained 0.49 standard deviations more 
than their untrained counterparts ?n math, but did not gain significantly more 
in reading. 

One difficulty with the TTV p r ogram was that it required that an 
elementary and junior high school be physically close to one another so that 
students could travel back and forth in the time allotted for one class period 
and still have time to tutor. The second program developed in Dade County 
avoided this difficulty by focusing on the recruitment and training of adult 
volunteers as tutors. In the School Volunteer Development Project, adult 




TABLE 3-2. Remedial Tutoring Programs 



Program and Source 



Grades/ 
Subjects 



Description 



Evaluation 



Effect Sizes 



/. Programs Evaluated Using Control Group Designs 



Training for 
Turnabout 
Volunteers 
(JDRP 0 8MI) 



Tutors 7-9 Cross-age tutoring. 
Tutccs 1-6 Tutors trained wiih 
Reading prescrvicc and 

Math in service classes. Not 

programmed material. 

Tutored 40 min/day. 

4 days/week. 

Inscrvicc training on 

fifth day. 



One-year study in 
Miami. FL. compared 
trained vs. untrained 
tutors. Noncquivatent 
but unbiased groups. 
Five schools in study. 



Ttiiccs 1-6 Tutccs: 



tutors 7-9 



MAT (math) 
MA'I (reading) 



I utors 
MAI (math) 
MA T (reading) 



♦ M 



♦ 49 
i OK 



School Volunteer 2-6 

Development Reading 

Project (JDRP 0 Math 
75-79) 



Community volunteer Students in Miami. FL. Or. 2-6 



tutors. 2-4 
hrs/wcck/volunlccr. 
Bach student tutored 
1/2 hr. 4 days/week 
minimum. 



randomly assigned to 
tutored or nontutored 
conditions for one 
school year. 



MAI (readmit) 
MAT (math) 



♦ Ml 

♦ II 
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Success Controlled 1-6 

Optimal Reading Reading 

Experience Math 
(SCORFJ 
(JDRP 0 8042) 



Each student tutored 15 Students in San 



min/day by older 
students or adult 
volunteers. 



Gr. 



Francisco. CA. 
randomly assigned to 
tutored, control 
conditions— 2 controls 
replicated over 3 yrs. 
No tests of reading 
comprehension — 
effects arc on word 
recognition only. 



WRAT 
Gilmorc 



♦ <>- 1 
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volunteers tutored underachieving first through sixth graders for a half hour 
a day four or five days a week. Tutors were trained prior to tutoring in a 
variety of tutoring skills and use of multimedia materials, and worked with 
the reading specialist on the skills they were tutoring. Again, a variety of 
materials pertaining to the skills being developed were used. Students were 
randomly assigned to tutored or untutored groups. Those who received 
tutoring gained 0.50 standard deviations more in reading and l.i standard 
deviations more in math than the untutored control students. 

In contrast to these two programs, Success Controlled Optimal 
Reading Experience (SCORE), the third successful remedial model iden- 
tified, uses highly programmed materials and a very specifically structured 
tutoring session. SCORE uses rapid drill and practLc in lists of words 
grouped to teach specific decoding skills (short and long vowel sounds, 
sound blending, word patterns such as night, light, fight, and so on). 
Students who are deficient in decoding skills are tutored for fifteen minutes a 
day until they complete the program, usually a period from foui to six 
months. 

In the original evaluation of the program, educationally handicapped 
students were randomly assigned to a special cla*>s for learning-disabled 
students, to regular class placement with no support, or to regular class 
placement with fifteen minutes of tutoring by students in grades 6 to 12. The 
SCORE treatment group exceeded both control groups on measures of word 
recognition and oral reading accuracy. Effect size estimates ranged from 0.5 
to 0.7 standard deviations. The results were replicated over thicc successive 
years. The program has been implemented in a wide variety of settings with 
lower middle- and upper-class populations in urban and rural settings using 
older students, parent volunteers, and aides as tutors. Gains in these settings 
have exceeded the gains in the original study in almost every case. 

A limitation of these data lies in the fact that no reading comprehension 
data are provided. The gains seen in word recognition and oral reading 
accuracy are large and important, and would be expected to enable readers 
to pay more attention to comprehending their reading, but there is no 
guarantee that this generalization occurs. Comprehension gains would have 
to be assessed in any implementation of this program. It may be that SCORE 
would provide a strong first step in a more complete program that explicitly 
addressed comprehension as well. 

Preventative Tutoring Programs, Four different programs have ad- 
dressed the problem of reading failure by attempting to prevent failure at the 
first-grade level. Descriptions of these programs appear in Table 3-3. 
Programmed Tutorial Reading (PTR) provides one-to-one tutoring by para- 
professionals to first graders in the bottom quartile in reading. The tutoring 
process is highly structured. The tutor uses programmed materials based on 
the basal scries used in the school that instruct the tutor in where to start, 





TABLE 3-3. Preventative Tutoring Programs 



Program and Source 



Grade)/ 
Subjects 



Description 



Evaluation 



Effect Sizes 



/. Programs Evaluated Using Control Group Designs 



Programmed Tutorial 1 only 
Reading Reading 
IJDKP *74-l7) 



Each student tutoi.u 
15 min/day by 
paraprofessionals or 
older students. 



Evaluation contained 
matched pairs of 
tutored/nontutored 
students in 
Farmington, UT. 
No pretest 
differences. Effects 
replicated using 
nonstandardized 
tests in several 
other districts. 



Gr. I. Vocabulary +.65 

reading. Comprehension +.41 

bottom Basal comprehension 

quartile measure +.50 













Gr. 1 


Gr. 2 


Prevention of 


1-2 Each student tutored 


Students randomly 


Gr. 1 and 2. 


WRAT 


i .86 


♦ 1.06 


Learning 


Reading 3-5 times/week for 


assigned to tutoring 


siudcnts 


Word 






Disabilities- 


30 min. by a 


or control 


with low 


Attack 


* 1.41 


1.67 


New York 


resource teacher. 


treatments. 


readiness 


Word 






(JDRP 179-33) 




Achievement 


scores. 


Recog. 


♦ .95 


* .91 



measured 2nd 
grades. 



Wallach Tutorial 1 only 
Program (Dorval, Reading 
Wallach, & 
Wallach, 1978) 



Students turorcd 30 
min/day, 28 hrs. 
total. Tutored by 
paraprofessionals- 



Unclear how students 
were assigned to 
treatments but 
groups were 
equivalent at 
pretest. Compared 
tutoring to no 
tutoring to time 
with aide. 



Gr. I, 
students 
below 
40% on 
MAT 



Spachc Word 
Recognition 

CTBS Total 
Reading 



1.3 GE 
.75 



Reading Recovery I only 
(Pinncll ct al., Reading 
1986). 



Students tutored by 
specially trained 
teachers 30 
min/day, at least 60 
days. 



Study 1 compared 
lowest readers in 14 
Columbus, OH, 
classes to similar 
students in 
comparison classes. 
Study 2 randomly 
assigned low 
readers to exp.. 
control. 



Grade I 
Low readers 



CTBS Rdg. 

Vocabulai y 
C TBS Rdg. 

Comprehension 



.70 



.92 



7'i 
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what to say, when to praise, how to respond to a failure in word recogni- 
tion, and so on. Students are essentially taught the words in the basal using a 
sight word approach, and in addition arc taught word analysis and passage 
comprehension skills using the same vocabulary in separate series of 
lessons. 

Programmed Tutorial Reading has been evaluated against a control 
group in two studies. In the first (Ellson, Harris, and Barber, 1968), students 
were randomly assigned to receive PTR, directed tutoring, or no tutoring. 
Directed tutoring involved having paraprofcssionals tutor students using 
specific materials provided by the teacher that were designed to correspond 
to the instruction going on in the classroom. Tutors in both PTR and directed 
tutoring were trained for about eighteen hours prior to the beginning tutor- 
ing. Results indicated that FIR was more effective than directed tutoring 
and control, but the differences were not large. 

In the second study of PTR (JDRP No. 74-17). students receiving 
fifteen minutes of tutoring a day were compared to students receiving no 
tutoring over a one-year period. In this study, substantial differences were 
seen between the two groups in favor of the tutored group. 

Two other prevention models used programs designed to improve 
students' skills in specific reading or prereading areas. The Wallach Tutorial 
Program (Dorval, Wallach, and Wallach. 1978) used paraprofcssionals to 
tutor children identified as low in readiness skills for one-half hour per day 
on phoneme identification skills. This training focused the students on 
breaking up the sounds in words so that, for instance, beginning sounds 
could be heard, sounded, and represented separately from the rest of the 
word. Wallach and Wallach based their program on research showing that 
disadvantaged children were drastically deficient in recognizing phonemes in 
heard words— such as knowing whether house or man starts with the sound 
"mmmmmmm." The tutored children performed at the 56th percentile on 
the California Test of Basic Skills at posttcst after having begun at the 17th 
percentile. The control gr-jp moved from the 19th percentile to the 35th. 
The effect size for the comparison is 0.75 standard deviations. 

Prevention of Learning Disabilities, a program developed by the New 
York University Medical Center, takes an approach that is somewhat 
similiar to the Wallach and Wallach approach. Students are screened for 
deficits in sensory skills related to reading at the kindergarten ot beginning 
first-grade level, and those found to be deficient arc tutored in those areas in 
which they arc low. Auditory discrimination exercises similar to those used 
by Wallach and Wallach form a significant part of this instruction. Students 
were assigned to work on a tutorial basis or in small groups (two to three 
students) with a resourcc*teachcr three to five times a week. The program is 
designed as a two-year program, but evaluations after both one and two 
years showed significant gains for the treated group on word recognition 
scales and on a word attack measure. No standardized tests of reading 
comprehension were used. 
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The mrsat recently developed of the preventative Jtoring models is 
Reading Recovery, a program originally developed in New Zealand by Clay 
(1985), and currently being adapted and evaluated in the United States by a 
group at Ohio State University. In this model, first graders who arc 
diagnosed as having difficulties in reading work onc-on-onc with specially 
trained tutors for thirty minutes a day. Students are released from tutoring 
when they achieve a certain level on diagnostic tests that arc part of the 
program. Some proportion of children arc released from tutoring at the end 
of first grade without having reached the required level. On the average, 
Reading Recovery children receive sixty to cigh.'v lessons (Pinncll, Short, 
Lyons, and Young, 1986). Reading Recovery instruction emphasizes having 
students read aloud from minibooks and write their own stories. No 
particular attempt is made to integrate tutoring with instruction students 
may be receiving in their regular classes. Tutors, who are certified teachers, 
arc trained over the course of a full school year, with trainers observing 
tutoring sessions from behind one-way mirrors and providing detailed 
feedback. 

Two cohorts of first graders have been compared to control groups. 
The 1984-85 cohort originally consisted of fifty -five Reading Recovery and 
fifty-five matched control students, the eight lowest-achieving first graders in 
seven experimental and seven control classes, respectively. 

As of this writing, these students have been followed through the end 
of the third grade (DcFord, Pinncll, Lyons, and Young, 1987). At the end of 
the first grade, the Reading Recovery students scored considerably higher 
than control students on the Stanford Reading Test (ES - +.605). Unfor- 
tunately, the standardized tests were not repeated in subsequent years. 
What was used longitudinally was an assessment of "text reading level" 
from the diagnostic test used in the Reading Recovery Program itself. On 
this test, effect sizes separating experimental and control students were as 
follows: 



Text Reading Level: 
Comparison Effect .size 

Pretest (Dec. '84) + .16 

End of Gr. I (May 85) +.87 

Fall, Gr. 2 (Oct. *85> +.75 

End of Gr. 2 (May '86) +.45 

End of Gr. 3 (May '87) +.29 

Clearly. Reading Recovery had a substantial positive effect on reading 
performance at the end of grade I, and this effect is still detectable (and 
statistically significant) two years later. Not surprisingly, though, the effects 
diminished over time. In addition to the achievement effects, it is important 
to note that Reading Recovery students were significantly less likely than 
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control students to fail a grade; 66 percent of Reading Recovery students 
were in third grade in 1986-87, compared to only 51 percent of control 
students. 

A similiar pattern of results was found for the 1985-86 cohort. By the 
end of first grade, these students exceeded control on CTBS Reading 
Vocabulary (ES = 0.6 7 ) and Reading Comprehension (ES = 0,61). In text 
reading level, the longitudinal results were as follows: 



Text Readmit Level: 

Comparison Effect .size 

Pretest (Oct. 4 85) + .i9 

End of Gr. I (May '86) +.97 
End of Gr. 2 (May '87) +.65 



A total of 81 percent of Reading Recovery students were promoted to 
the second grade, in comparison to 73 percent of control students. 

The results of the Reading Recovery studies indicate that this model 
can have substantial effects on the reading achievement of the first graders. 
Although the effects diminish over time, they arc still important at the end of 
third grade, even in the absence of any special programs in grades 2 and 3, 

Overall, the preventative tutoring programs have great potential for 
giving students a good start in reading. Combined with less intensive 
instructional improvements in the later grades, these programs may provide 
a means of obviating the need for remedial or special education for all but the 
most retarded or disabled students (sec Chapter 12). 



Computer-Assisted Instruction 



Computer-assisted instruction (CAI) can be seen as another form of tutoring 
in which the tutor is a machine rather than a person. Like a human tutor, 
computers can determine students' needs, provide instruction appropriate to 
those needs, recognize and reinforce student success, and keep records of 
student progress. However, computers arc usually less able to explain 
concepts to students, and for this reason CAI programs invariably have 
teachers available to students while they arc working on their computer 
lessons. Descriptions of effective CAI models appear in Table 3-4. 

Several studies of CAI as a puliout program for students in need of 
remediation have been conducted, but both the largest number of evalu- 
ations and the highest quality evaluations have involved the reading and 
math programs developed at Stanford University in the early 1970s and 
cunently disseminated by the Computer Curriculum Corporation (CCC) 
(Jamison. Fletcher, Suppcs, and Atkinson, 1976). Unlike most current CAI 
programs which operate on microcomputers, CCC uses a mainframe with 
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terminals linked to a central processing unit by telephone. The computer 
tecps records of student performance levels and progress, and provides 
students with exercises appropriate to their needs. 

The most important evaluation of the CCC curriculum itself was 
conducted in the Los Angeles Public Schools by the Educational Testing 
Service (Ragosta, 1983). In this four-year longitudinal study, students were 
randomly assigned to receive ten minutes per day of CAI in mathematics, 
reading, or language, as part of a thirty-minute Chapter I pullout period' 
Results on the CTBS indicated substantial effects of CAI on math compu- 
tations, which increased from an effect size of 0.36 in the first year to 0.72 by 
the .hird year of intervention. Effects on concepts and applications scales 
were smaller and nonsignificant. In reading, positive effects were found for 
vocabulary and comprehension scales after one year (ES = +.25 and +,23, 
respectively), but while the vocabulary effects increased to + .59 by the third 
year, comprehension effects actually favored the control group by that time 
(ES = -.24). Effects in language mechanics were at about one-quarter of a 
standard deviation all three years, but were small and generally nonsignifi- 
cant on language expression and spelling scales. 

The JDRP-approvcd projects that used the CCC curriculum also 
present convincing evidence of effectiveness. One, a program evaluated in 
Lafayette Parish, Louisiana, called Title I Mathematics Laboratory with 
Computer Assisted Instruction (JDRP No. 82-46), randomly assigned 
students to regular Title I pullout or to a combination of the regular pullout 
model with ten minutes of CAI in mathematics. Effects on the CTBS Total 
Math Scale were modest, only +,19, but the extraordinary quality of the 
experiment makes the effect credible. Another JDRP program based on the 
CCC reading materials is the Merrimack (Massachusetts) Education Center 
(JDRP No. 82-34), a program that supplements ten minutes per day of CAI 
with tutorial and small-group instruction from the teacher. A study with 
random assignment of students to CAI and control treatments found positive 
effects averaging 40 percent of a standard deviation on the MAT reading 
scale. 

One interesting CAI program thai docs not use CCC materials is Basic 
Literacy through Microcomputers, a model developed and evaluated in Salt 
Lake City, Utah. This program uses either electric typewriters or computers 
to supplement the teacher's instruction by giving students opportunities to 
practice and apply phonics skills by typing words, sentences, and stories. 
The evaluation of this program involved comparison classes in gains on 
standardized tests. Differences at grades I and 3 favored the experimental 
group, but substantial pretest differences at grade 3 make these results 
inconclusive. Ironically, the grade I experiment, which found substan- 
tial positive effects on California Achievement Test Total Reading scores 
(ES = +.58), used typewriters rather than computers. 

Overall, results for the CAI programs (especially CCC) are well- 
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TABLE 3-4. Computer Assisted Instruction Programs 



Grades! 

Program and Source Subjects Description Evaluation Effect Sizes 

I. Programs Evaluated Using Control Group Designs 
Computer Students engage 

Curriculum Corp. in drill and 

practice 

activities on 

computers 10 

rnin/day in 

addition to 

regular 

instruction. 



Study I 
(Ragosta. 



1983) 



1-6 

Math 

3-6 

KcaJing. 
Language 



Comparison with 
randomly 
sclcrted 
controls. 
Groups studied 
longitudinally 
over four years 
in Los Angeles 
Unified School 
District— 4 
schools. 



Gr. 



-6 Math 
-6 Rdg. 
-6 Lang. 



CTBS 


Yrl 


Yr2 


Yr3 


Math 








Comp 


+ .36 


♦ .56 




Cone 


-.02 


♦ .12 


f .09 


Appl 


+ .03 


♦.12 


+ .26 


Rdg 








Voc 


•« .25 


♦ .17 


+ .58 


Comp 


♦ .23 


-.01 


-.24 


Spel 


♦ .14 


+ .05 


*.I4 


Lang 








Mech 


♦ .22 


+ .27 


+ .25 


Kxpr 


+ .11 


♦ .05 


♦ .23 



Study 2 

Lafayette Parish 
Title I Math 
(JDRP #82-46) 



3-6 
Math 

(Chapter !) 



Compared to 
students 
receiving 
Chapter 1 
pullout. 
Students 
randomly 
assigned in 
Lafayette 
Parish. LA. 



Gr. 3-6 Math CTBS Total Math 



+ .19 



Study 3 
Merrimack 
Education Center 
(JDRP #82-34) 



Basic Literacy 
Through 
Microcomputers 
(JDRP #84-14) 



2-9 

Reading 
(Chapter I) 



1-3 

Reading 
(Chapter I) 



Students 
randomly 
assigned in 
Merrimack. 
MA. Compared 
to students 
receiving 
Chapter I 
pullout. 



Gr. 2-9 Rdg. MAT Reading 



+ .40 



Microcomputers Students 



or typewriters 
used to 
practice 
phonics and 
writing skills in 
addition to 
regular 
instruction. 



randomly 
assigned. 
Compared to 
students 
receiving no 
additional time. 



Gr. I CAT 
Reading 



+ .58 
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established and positive, though in the best-controlled studies they arc 
usually modest in magnitude and appear more frequently on basic skills than 
on higher-order skills. Since the costs of CIA can be very high (sec Ragosta, 
1983), this approach can be compared to adult tutoring, which tends to have 
larger effects in studies of similar methodological quality. 



CONCLUSIONS 

Chapter 2 reviewed research on effective classroom programs for students at 
risk for school failure. The review concludes that the most consistently 
successful classroom models were continuous-progress programs in which 
students arc taught in skill-level groups and proceed through a hierarchical 
set of skills, and cooperative learning programs in which students also 
receive instruction at their appropriate levels but then practice skills in 
mixed-ability learning leans. On the basis of this and other evidence, 
Chapter 2 concluded that effective programs for students at risk balanced 
adjustment of instructional approaches to meet students' unique needs with 
provision of adequate direct instruction. In addition, effective classroom 
programs provide frequent assessment of student progress through a well- 
specified, hierarchical set of skills. 

The examination of effective puilout programs conducted in this 
chapter provided further support for these conclusions. The most successful 
models, tutoring and CAI, completely adapt instruction to students' unique 
needs and provide plentiful direct instruction appropriate to student*/ levels 
of readiness. Diagnostic-prescriptive pullout models, which have generally 
been less successful than tutoring or CAI, also carefully assess students' 
needs and adapt i,. >iruction to these needs, but often suffer from one of two 
problems. Chapter I pullout teachers often work with individual students, 
leaving others in the group to spend much time working on worksheets, 
which may be of relatively little value. Alternatively, Chapter I teachers 
may present lessons to heterogeneous groups of students that arc poorly 
adapted to their individual needs, and may be poorly integrated with 
instruction being provided in the regular class (sec Chapter II). In cither 
case, the instruction provided in traditional diagnostic-prescriptive pullout 
programs may not be markedly better than that provided in the regular 
classroom. If so, it is unrealistic to expect that thirty to forty-five extra 
minutes of instruction will make a substantial difference in achievement. 
Both the tutoring and the CAI studies suggest that intensive interventions 
are needed to make a substantial difference in a pullout program. 

Taken together, the conclusions of this chapter and Chapter 2 suggest 
that the achievement of at-risk students can be significantly increased, cither 
by making relatively inexpensive but extensive modifications in the regular 
instructional program or by implementing relatively expensive but intensive 
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interventions as puilout programs. It is possible that a combination of these 
strategics would be more effective than cither one by itself. 

There is much more we need to know about effective programs for 
students at risk. Research on diagnostic-prescriptive models is generally of 
low quality; it may be that certain forms of diagnostic-prescriptive models 
would be highly effective in well-designed experiments, but much research 
and development is needed to establish what these forms might be and how 
much difference they could make in student achievement. The tutoring and 
CA1 studies are of much higher methodological quality, but except in cases 
where peer tutoring or volunteer programs are practical, these arc expensive 
interventions. For practical as well as theoretical reasons, then, there is a 
need to identify the elements of tutoring and CA1 that account for their 
effects, so that perhaps these same principles could be applied in a less 
expensive form. In addition, the cost-effectiveness of alternative approaches 
to prevention and remediation remains to be determined. For example, it is 
possible that very expensive but very effective preventative tutoring models 
may, by reducing retentions, remedial needs, and special education refer- 
rals, be cost-effective. Even if thc?»c preventative programs could not 
completely justify themselves on a dollar basis, however, they could help 
avoid the human cost of retention and special education. 
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Early childhood education has traditionally been aligned \uth social and 
educational reform efforts (La/ar and Darlington, 1982). From the first 
pioneers who championed early education for the poor (e.g., Comcnius, 
Pcstalozia, and Montcssori) to present-day advocates of quality preschool 
programs for students **at risk/* the connection between early education 
and social intervention has clearly been made. For example, in the United 
States the early efforts of the Pcabody sisters in Massachusetts to provide 
early education for children of poverty were motivated by the desire to 
better the future lives of these children. More recently , the establishment of 
the Head Start program as a part of President Johnson's War on Poverty 
provides evidence of the connection between early education and social 
reform. 

Despite the existence of this long-term connection between preschool 
and social reform, actual support for preschool has been intermittent, being 
picked up and abandoned as a function of economic, social, and political 
forces. For example, during the depression and World War II, the federal 
government sponsored nursery schools, but had little other involvement 
until the mid-1960s. The spotty history is in pari due to the fact that 
prcschools arc operated by multiple sponsors, including churches, public 
schools, private schools, and profit and not-for-profit day care centers. 
Given the different auspices, regulations and governing agencies vary 
widely. Consequently, no regularized, institutionalized system guarantees 
preschool for the disadvantaged or for any other group. Head Start, and 
programs run under Chapter I funds, have existed for more than tweriy 
years, but arc constantly in jeopardy and have never served a large 
proportion of the eligible students. (Sec Chapter 6 for a discussion of the 
histor> of Head Start.) At present, preschool has no legal, provisional, or 
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guaranteed financial foundation. Part of the issue surrounding preschool is 
the question of its legal and financial status within the education system. 
Although such questions arc inherently political and social, issues of 
educational benefit arc clearly relevant in this debate. 

It is therefore important and especially timely for a discussion of the 
evidence regarding the short-term and long-term effects of preschool. What 
does research indicate about the effectiveness of preschool as an interven- 
tion strategy? What is the evidence about the effects of particular programs 
for students at risk? Finally, what arc some of the larger issues that need to be 
considered as states and locales move to include prcschools in their 
elementary schools? 

This chapter reviews the major literature on the effects of preschool. 
Particular attention is given to the question of the effects of specific 
curricular models and programs, including programs that have been certified 
as effective by the U.S. Department of Education's Joint Dissemination 
Review Panel (JDRP). 



THE CHANGING FACE OF PRESCHOOL 

The term preschool covers a wide variety of early educational experiences. 
It may mean a half-day nursery school program that emphasizes play and 
socialization, or it may mean a full-day academic program where the 
intention is to teach reading and math readiness skills. It may mean 
programs for four-year-olds, thrcc-ycar-olds, o- even earlier intervention 
efforts. In this chapter, we focus primarily on prckindcrgartcn programs for 
four-year-olds, although a few studies included three- and five-year-olds as 
well. 

The first important ch'iiigc in preschool programs is the vcr> rapid rate 
of growth over the last twci.ty years. While only 15 percent of all four-year- 
olds were enrolled in 1964, almost half of all four-year-olds are presently 
enrolled in preschool (Center for Education Statistics, 1986, fig. I). 

A second important change is the auspices for preschool. There is a 
growing involvement of public schools in the provision of preschool 
programs. State education agencies in fifteen states and the District of 
Columbia currently fund prckindcrgartcn programs for four-year-olds 
(Morado, 1985). State-funded programs are operated either by permissive 
language in the state's school code or by special legislative provision for 
programs. One example of permissive language is the provision for pre- 
school in Texas where the legislation reads "Any school district may offer 
prckindcrgartcn classes, but a district shall offer prckindcrgartcn classes if 
the district identifies 15 or more eligible children. "The bill defines as eligible 
those four-ycai-olds who are either "unable to speak and comprehend the 
English language" or arc "from a family whose income ... is at or below 
subsistence level" (Schwcinhart, 1986, p. 35), 
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The tnc ^asc in provision of public preschool will most likely result in 
different attendance patterns for different income and ethnic groups. Recent 
statistics on preschool enrollment indicate that of the four-year-olds enrolled 
in prcschoo: in 1984, there are already differences in patterns of enrollment 
by race anc income. Whites arc more likely to be enrolled in private 
prcschools than arc blacks (32.9 versus 16.2 percent). Income level is an 
important determiner of enrollment patterns. For families wiih incomes less 
than $20,000, a greater proportion of black four-year-olds arc enrolled in 
preschool than arc white, and this difference is primarily due to differences 
in public school enrollment. For families with incomes greater than $20,000. 
a greater percentage of white children participate in preschool than do black, 
and this again comes about primarily due to differences in auspices, with 
43.6 pcrccn; of the whites attending private preschool. 

Bciwctn 1975 anrt i984, the greatest growth in preschool attendance 
was among h.gh-incomc white children with mothers not in the labor force 
In 1975. 53.4 percent of white four-year-olds in families with income of 
$15,000 and over were in preschool. In 1984, 65.9 percent of white 
four-year-olds in families with income over $30,000 were in preschool. The 
percent of enrolled four-year-olds of nonworking mothers rose fror 37.5 
percent in 1975 to 48.2 percent in 1984. 

These figures suggest that there may be two very different trends 
developing in the expansion of early education. One «brcc is the continuing 
demand for compensatory programs, as witnessed by the state efforts at 
programs for the disadvantaged. The other growth is in the private sector for 
programs that primarily serve high-income families with nonworking moth- 
ess and mothers who are employed part-time. Private preschool a-ocars to 
be fronting at a more rapid pate than public preschool. This uneven growth 
in preschool opportunities creates greater inequality in preparation for 
srh'xjl. 

A third area of change is the singular view of the significance of early 
childhood experiences for late; development. The empirical groundwork 
suggesting the importance of environment on development was started b> 
studies of the Iowa schools in the 1930s and 40s. Although the studies lacked 
adequate controls and experimental designs, the publication of results 
helped contribute to a sens,c that intci vcntion strategics aimed at earl* ages 
were particularly effective in overcoming effects of deprivation. 

Kirk 1 195*?) earned out the first properly controlled experiments on the 
effectiveness ot -arly intervention. He conducted a two-year intervention in 
which tnc JQ of the treatment group was significant raised. Eventually, 
however, these IQ gams washed out, and students who had benefitted from 
ihc i-irly intervention program were inaistinguisiiablc from those who 
had not. 

Longitudinal studies of orphans/adoptees (Skccls, 1966) were also 
influential in building the view of the critical importance of early life 
cxpcncnrcs. 7 Acnty-tl* c infant residents o f Iowa Soldier's Orphan Homes 
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who were previously not offered for early adoption because of adverse 
social histories were divided into treatment and control conditions. Thirteen 
treatment children were transferred to an institution where active, stimulat- 
ing efforts were made to make them more adoptablc (i.e., to increase their 
apparent IQ). Twelve stayed at Orphan's Home without intervention. 

In adult life the contrasts between these two groups were stark. The 
experimental group on (he whole were normal, well-adjusted members 
whereas the control group was not. This study reinforced the notion that 
early intervention could have an important impact on adult social and 
educational status. 

The early 1960s saw an increased focus on early childhood as a prime 
time for intervention. Part of the reasoning was that because infancy is a 
period of unusually rapid maturation and sensitivity, i.tcrvention at this 
point was particularly critical. Bloom (1964), in Stability and Change in 
Human Charm terhth s. states the prevailing beliefs about carl) develop- 
ment and intervention as: 

In terms of intelligence measured ai age 17, about W,< of the dcvclopmcni 
takes place betwee ) conception and age 4, W,< between 4 and 8, 209£ between 
8 and 17 The evidence so far available suggests that marked changes in 
environment in the early years can produce greater changes in intelligence than 
will equally mrked changes m the environment at later periods of develop 
ment. 

Thus the prevailing belief in the mid-1960s was that a single, restricted, 
and essentially modest intervention could have life-long consequences. As 
Elkind states, *'Grca' expectations and promises were based on the view 
that the young child was plastic material to be molded quickly and 
permanently by the proper school environment." The early childhood 
intervention programs of this era were conceived in a period of great 
optimism and equally great naivete about the possibility of the strength of 
early intervention. 

Some of the enthusiasm wa> diminished by early evaluations of Head 
Start and by statements that "wc have tried compensatory education and ti 
has failed" {Jensen, 1969). Yet, uf the interventions born in this era, the 
long-term positive effects recently documented are altering this gloomy 
picture (Berrueta-Clcmcnh Schwcinhart, Barnctt, Epstein, and Wcikart 
1984). 

Accompanying this shift in the view of environment on development 
was an important redefinition of the role of early cognitive experiences. 
Prior to the 1960s, there was actually very little concern with the issue of 
early intellectual functioning. Instead, the concern was with the develop- 
ment of the healthy personality of the child, which if properly taken care of 
would automatically include intellectual maturity as well. 

When this "normative maturational" view of early development was in 
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vogue, There was little interest or concern over providing appropriate 
learning environments, since satisfactory intellectual development was seen 
as prinwily maturational, not environmental. After the late 1960s and into 
the present, this view of childhood has given way to what Elkind (1986) calls 
the "competent infant" point of view. 

The competent infant movement has two important asssumptions. 
0) malleability of traits such as IQ and (2) singular importance of early 
intervention to prevent later impoverishment in intellectual functioning. The 
latter issue points directly to the need for appropriate early childhood 
eduction programs both as an intervention strategy for the disadvantaged 
and as an appropriate educational step for advantaged students as well. 

With these changes in the views of children and with ever increasing 
numbers of women in the labor force, it is not surprising that the area of 
early education has become an area of great interest in the 1980s. Many 
states and local districts arc buying into the idea, but there arc some 
concerns being voiced. For example, some black citizens note that the 
school system has failed then already and question why the public schools 
should have any better a record in early education than they do in 
elementary and secondary education. Others note the difficulty in funding 
programs at a sufficiently high level to duplicate the quality programs that 
served as an impetus for implementing preschool programs in the first place 
(Grubb, 1987). Still others are concerned that the pre : ,chools arc being 
implemented to help alleviate the need for child care and question why the 
public schools should foot the bill for tacsc services. Therefore, many 
social, political, and educational issues arc involved in present discussions 
about preschool. Nonetheless, the educational benefits of preschool pro 
grams should be a primary consideration and understanding the short and 
long-term educational benefits of such programs is central to the current 
debate. The discussion here focuses first on those studies that have 
examined the effects on students of attendance at preschool m comparison 
to students who did not attend prcsciiool. Next, the discussion considers the 
results of participation in particular types of preschool curricula. 



PARTICIPATION IN PRESCHOOL PROGRAMS 

The first set of studies we consider addressee the issue of the effects of par- 
ticipation in preschool programs. An important body of studies, eollec 
tively known a* Tin Consortium for Longitudinal StuJus (1982), provides 
critical evidence of the iong- and short-term effects of early intervention 
efforts. The Consortium came about in response to the findings of an 
influential Wcstinghousc (1969) evaluation of Head Start, which concluded 
that the program was not having long-term effects on success in school. 
Recognizing the need for true longitudinal studies to as>css effects cf early 
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intervention. Lazar and Grotbcrg proposed in 1974 finding well-designed 
earlier studies that could be used as a base for follow up efforts. Fifteen such 
studies, authored by twelve developers, were located. The studies differed 
in many respects— sample size, age of student, approach to intervention— 
but they all had in common the intention to assist the future chances of 
low-income children, primarily from minority homes. The Consortium then 
examined the long- and short-term benefits of enrollment in these programs. 

For our purposes, we arc primarily interested in programs whose 
effectiveness was determined by an adequate experimental design and which 
focused on programs for four-year-olds. We focus on four-year-olds because 
most programs being implemented today arc specific to this age child. 

From the fifteen projects in the Consortium studies, six projects were 
located that focused on four- year-olds (those developed by Bcllar. Dcutsch. 
Gray. Karnes, and Wcikart). From this list of six, two studies (Gray and 
Wcikart) randomly assigned students to treatment and control groups. 
Following Slavin's (1986) best-evidence synthesis methodology, these two 
studies arc therefore given the greatest attention here. 

Gray (Early Training Project) 

Gray's Early Training Project (Gray . Ramsey . and Klous. 1982) consisted of 
a ten-week summer program in which students met for four hours daily . five 
days per week. Students participated in a class of twenty, served by one 
teacher and four assistants. During the school year, home visitors also 
worked with each family once a week for a period of one hour. 

The Early Training Project focused on perceptual 'cognitive and lan- 
guage development using a traditional nursery school formal, but with 
activities sequenced to become increasingly complex and carefully focused 
on increasing language use. The first entry in Tabic 4-1 summarizes the 
major features and effects of Gray's study. 

Sixty-one students were randomly assigned to one of three conditions, 
two groups that entered the program just described at different ages, and an 
untreated control group. An additional control group in another city was also 
used The two different treatment condition* differed only in the age at 
which they started the program. 

The pretest IQ scores for the treat lent and control groups wcx 89.4 
and IP 3. respectively At age five, after .Mrticipation in the program, the IQ 
scores were %.05 and 86.3. By age 17. the IQs were once again very similar. 
78.7 and 76.4. 

Gray determined :ha« 2.4 percent of the program children were placed 
in special education and 23.8 percent of the control children wcic so placed. 
The program children also showed a greater likelihood of not bcrng ictaincd 
in grade Nearly 56 percent of the program children were retained in grade, 
whereas the figure was Joscr to 69 percent for the control children About 
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TABLE 4-1. Effects of Preschool Attendance on Achievement and Other Outcomes 



Strategy/ 



Project 


Program/ Focus 


Treatments 


Poprhitiott 


Study Design 


M casta esl Effects 




Early 


Intervention program 


Morning (4 hrs) 


88 black 


Group A (67 


10 


TRT CTL 


ES 


Training 


to change 


summer 


children in 


students) 


End of yr 


89.4 87.3 


AS 


Project 


attitudes/aptituoes 


session for 10 


Murfrccs- 


randomly 


+ I yr 


96.1 86.3 


.81 


(Gray) 


needed for school 


wks and 


boro. TN, 


assigned to 


+ 2 \TS 


94.9 81.9 


1.09 




success. 


weekly 1-hr 


aged 3 1/2- 


Tl (3 yrs 


+ 3 yrs 


97.7 89.6 


.63 






home visits 


4 1/2 in 


treatment). 


t 4 yrs 


93.6 86.1 


.51 






with 


1962. Grp 


T2 (2 yrs). or 


+ 6 yrs 


88.4 81.2 


.54 






parent/child 


A = 61 


T3 (no trt). 










to emphasize 


children in 


Group B was 


Spec. ed. 


2.4 23.8 


.50 






parental role 


same town; 


second 


Repeat 


47.5 55.6 


.16 






in education 


Grp 


control group 


Drop out 


22.0 '3.0 


.14 






of child. 


B - 27 


<T4). 












Small staff 
















ratio (5: 1). 
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TABLE 4-1, (continued) 



00 



Project 



Program! Pot us 



Strategy/ 
Treatment* 



Population Study Design 



Measures {Effects 





Examination of effect 


dirrictilar 




nvt waves. 


Preschool 


of preschool on 


annmnch 




1 V0«: — 0 / , 


( Rcrreuta- 


rl i ^aH v n n t n of*{\ 


DdDlU U'i 


3- to 4-yr- 


Students in 


Clement 


rhilrlr^n 

1 1 1 1 \l 1 \r 1 1 1 




oids from 


each wave 


ct al.. 


Longitudinal study 


mentally 


Vrit.il*»nf i 
i I'^iitiiiii* 


rdnuomiy 


1984) 


followed students 


appropriate 


ML with 


assigned to 




up through age 19. 


activities 


IQs 


experimental 






stressing role 


between 


or control 






of planning 


60-90, not 


condition. 






and active 


otherwise 








learning of 


handi- 








child. 


capped 





Wave 


PS 


PS 


0 


n 


IS 


1 


8 


9 


2 


12 


14 


\ 


n 


14 


4 




n 


lotal 


58 


65 



Cognitive I Scholastic 



Age 


PS No PS 


ES 


Pre 


79.6 78.6 


M 


5 


94.1 83,2 


1,01 


6 


91.3 86,3 


.45 


7 


91.7 87,1 


.44 


8 


88.1 86.9 


.11 


10 


84.9 84.6 


.03 


14 


82.3 82,0 


.04 


CA T (percent correct) 


Age 


PS No PS 


ES 


1 


37* 32* 


.11 


8 


53 47 


.12 


9 


46 38 


,14 


10 


60 54 


.12 


1 1 


67 65 


.05 


14 


36 28 


.17 




School Success 






PS No PS 


ES 


HS grad 


(>T/ f 49* 


+ .72 


Retain 


15 13 


- «22 


Special 


31 45 


+ .27 




Soi iaV Behavioral 






PS No PS 


ES 


Arrest 


22* 41* 


+ .85 


Employ 


48 29 


.92 



NY State 
Experi- 
mental 
preK 
Ptogram 
(Irvine. 
1982) 



Individualized 
program 

emphasizing total 
environment and 
student choice. 



Parent 

involvement. 
Health srvs. 
Social srvs. 



Longitudinal 
prcK-3 

5000 

disadvan- 
taged in 
program, 
selected 
into pgm 
on low- 
economic 
status. 



Control grp was 
"waiting-list 
control 
group" 
cligiblcs 
without 
space; and 
other-district 
control grp. 
similar but 
not operating 
preK. 

Two waves. 
1975. 1976 



Wave I Wave II 
WLICTL WUCTL 



Walker 
Readiness 


PK 




+ 


Start 


K 






End 


K 


0 


0 




1 


0 


0 




2 


0 


0 




3 


0 


0 


Pcabody 


PK 






Start 


K 




0 


End 


K 


-f 


+ 


(Longilud. 
analysis) 


1 

-> 


4. 


0 

0 




} 


4 


0 



2 



Grade Rcpctihon 
Special Education 
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22 percent of the program children dropped out of high school prior to 
completion; the corresponding figure for control children was about 43 
percent Thus the program had significant effects on important variables of 
grade repetition, special education status, and high school completion. The 
effect sizes arc in the range of 0.14 to 1.09 (sec Table 4-1, far right-hand 
column). 

Concerning the effects on reading and mathematics achievement in 
elementary schools, the Gray study provides little reason to expect contin- 
ued effects of program participation. The performance of students in the 
program was not significantly different from that of control students in math 
or reading at grades 4 and 6. Thus the study docs not lend support to 
long-term effects of this type of early intervention effort on achievement. 

When the effects arc analyzed separately by sex, a somewhat different 
picture emerges. There arc pronounced sex differences. In particular, the 
G'*ay program appears to have been beneficial for girls, but not for boys. 
Table 4-2 summarizes these results by sex. It is not clear what factors or 
processes created these sex differences in the Grav study. 

Weikart (?em- Preschool) 

The other study included in the longitudinal followup that focused on 
four-year-olds and used random assignment in its design is the Perry 
Preschool Project (Bcrructa-Clcment ct al., 1984). There arc two parts to the 
Perry Preschool data that will be discussed. The first considers the effect of 
participation in preschool versus no preschool. The second considers the 
effect of participation in a particular preschool curricula. 

The sample consisted of 123 di:>advantagcd. low-IQ children from 
Ypsilanti, Michigan, The subjects 4 verc recruited by locating all families with 
thice-ycar-old children and interviewing the parents to determine thctr 



TABLE 4-2. Comparison of l;\pcnmi ni.il , ml Control Groups hy Sex in Ora> *s 
Early Training Project 
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Males females 

hinH > r h.\P CfL I S I XP ill hS 

Repeat grade .76 .S3 « ^ M) J0 

Complete HS 24 36 10 7S 20 80 

EMR placement .09 .36 41 (H) in 6S 

WLSC-R 79 2 7 6 . 8 ix 78 2 



Task: Reading 33. | 33 2 



n 



76 



.06 40 7 V7 4 18 



E »* l,sh 31.1 33. 9 -.21 4 (vl *9<> 46 

Malh 22.1 22/ .02 2"0 20 6 "»S 
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occupation, education, and household density. Children from low-SES 
families were administered an 1Q test and those who scored in the 70-85 
range were selected. Students were randomly assigned to treatment and 
control groups. 

There were five waves of this study, beginning in 1962 and continuing 
through 1967. Across the five waves, fifty-eight students were assigned to 
the preschool condition and sixty -five to the no preschool condition. 

In contrast to the Gray project, the Perry Preschool Project took place 
during the academic year as a regular school program. The curricular 
approach was based on the theories of development of PiagcL The "theoret- 
ical framework emphasized the interplay of content (e.g.. classification and 
spatial relationship), tru're levels of representation (e.g.. index, symbol and 
sign), and two levels of op ation (motor and verbal)" (Lazar and Darling- 
ton. I9S2) The approai h emphasized development. H\ appropriate activities 
and stressed the role of students' planning and innw'n in their own 
learning. 

Children entered the program at age thr^e and attended the program for 
two years, mid October to May. The sessions were half-day. five days per 
week. In addition, the teachers visited the home of the students for ninety 
minutes weekly. 

The short term benefit of the Perry Preschool program is evident in the 
eleven-point differences in 1Q scores between program and control students. 
The percent enrolled in special education is also appreciably lower for the 
preschool cnrollccs (45 percent versus 3! percent). The differences between 
those repeating and not repeating a grade were 2 percent. The largest 
differences were in the pcrceni who graduated from high school, with 67 
percent of the experimental stuJcnts complcling high school and only 49 
percent of the students in control classes. Similar differences in the percent 
employed and arrested were found, favoring the Perry Preschool Program. 

The differences in the percent correct on the CAT. taken at ages 7. 8. 9. 
H. 1 1 and 14. favored the Perry preschool children but were not statistically 
signi cant and ranged from oniy 2 percent to 8 percent. The second entry in 
Tabu I sumrnarizc> the findings from the Perry Preschool. 

L^.n the Gray and the Perry Preschool programs support the conclu- 
sion that preschool programs can have strong immediate effects on cognitive 
functioning, as measured by IQ tests. These effects arc apparent at the onset 
of treatment and continue scvcial years afterward. In the case of Perry 
Preschool, the experimental group is 44 percent of a standard deviation 
above the control at age seven In the case of Gray's study, the effects arc 
still detectable six years aftci the initial treatment (effect size - .51). 

Both studies ako showed similar patterns of effects on reduced referral 
to spcual education and lower rates of dropping out of hign school. In the 
case f the Perry Preschool Program, the effect size wa 0.72 for high school 
graduation. In the vase of Gray . the effect size was 0,80 for females and 0. 10 
for males. 

9 
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Both studies showed minimal long-term effects on achievement as 
measured b> standardised tests. Gray's stud> measured achievement at 
grades 4 and 6 and showed no significant effect on achievement scores at 
cither grade. Weikart's study also shows only modest achievement effects 
(effect sizes of 0.05 to 0.17). 



Head Start Evaluations 

Head Start programs were begun in !965 as a national effort to "improve 
children's intellectual skills, to foster their social and emotional growth and 
to help meet their health and nutritional needs." TK important o\ early 
intervention was based on the belief that infancy was an especially critical 
time for intervention. It was believed that intervention of a short dura- 
tion during this critical time could provide a life-long developmental boost 

Perhaps the most publicized evaluation of Head Start was the West- 
inghousc Evaluation (1969), which concluded that Head Start was not 
satisfactorily meeting its objectives. However, many methodological and 
philosophical issues were leveled against the report and its conclusions. 

The siudy used a posttcst-only research design, comparing the 
achievement of Head Start attendees with a sample of students matched on 
age awl sex. The Head Start sample included children who attended sum- 
mer sc.iool only and children who attended full-year Head Start programs. 

The posttcsi-only comparisons indicated no measurable advantage of 
Head Start children in summer programs over comparison children. The 
full-year program was more effective than the summer program, but the lack 
of robust effects led the authors to conclude that "the benefits cannot be 
described as satisfactory." 

A major criticism of the report was the expectation that success would 
be measured as positive effects not only immediately after the program, but 
also several years Lier. Other criticisms of the report cited the inscnsitivity 
of the rcscaich design to differences in programs and ihe nonequi\a!cnec of 
the control and experimental groups. 

More recent and methodologically rigorous evaluations (McKcy , Con- 
dclh. Ganson. Barrett, McConkcy, and Plantz, 1985) have emphasized that 
Head Start was apparently successful in meeting many of its objectives. 
Studies using random assignment or matched control groups in the McKcy 
review provide very strong evidence of the immediate effects of Head Start 
on cognitive functioning (average effect sizes 0.52), but little evidence for 
long-term effects (effect size - 0,10 after first year, 0.08 after second year, 
and 0,02 after i+ years), A more positive picture of me long-term effects 
emerges if grade retention and special education placement arc used as the 
criteria. Here, the three studies of the long-term effect indicate median 
effect sizes of 0.31 for retention and 0.29 for special education placement 
(McKcy ct aL 1985. p. 19). 
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New York Experimental Prekindcrgarten Study 

New York state began an experimental prckindcrgartcn program in 1966. 
designed to ^crvc children from low-economic homes (Irvine, 1982). The 
prekindcrgarten program had four components, classroom experience, 
health services, social services, and parent involvement. 

The effects of the program were assessed by comparison of the 
prekindergarten group with two control groups. One group was comprised of 
students who were on a waiting list for the program. The other group was 
comprised of eligible but not participating students. 

The effects of the program were assessed by a comparison of these 
groups at the end of the prekindcrgarten year and at the beginning and end of 
kindergarten and at the end of grades one, two. three, and six. 

The comparisons suggest that the prekindcrgarten program had immc 
diate effects and some sustained effects in terms of achievement, grade 
retention, and rates of referral to special education. With the available 
technic ll and published reports, it was not possible to compute effect sizes 
for all relevant ;omparisons. Thus Table 4-1 indicates on!) the direction <« r 
the effect without its magnitude. 

These four studies collectively suggest that there is an immediate and 
sizeable cognitive effect for participation in preschool that is diminished but 
still detectable in the elementary grades. The effect sizes diminish from a 
range of around 1.00 standard deviation at the enu of the prckindcrgartcn 
year, to around 0.20 by the end of the third grade. 



EFFECT OF PARTICIPATION IN 
PARTICULAR PROGRAMS 



Curriculum Comparison Studies 

We now consider studies that have examined the effect of participation in 
particular preschool curricula. Two types of studies arc considered— those 
that make d : rcct comparisons among particular ^urrLular models (Table 
4-3). and those that provide evidence of effectiveness of a particular model, 
in particular JDRP programs (Table 4-4). 

The first study contrasted the immediate and long-term effects of three 
different preschool curricula. The three models were a language training 
approach based on the work of Bcrcitcr and Engclman (I966j. a cogmtivcly 
oriented approach based on the Perry Preschool model, and a unit-based 
approach which followed a traditional nursery school model. These models 
represent a wide range of approaches to preschool education. 
* The structural features of the programs were identical. The programs 



TABLE 4-3. Comparison of the Effectiveness of Particular Curricular Approaches 



Project 



Program* Ftn us Treatment 



Population Study Design 



Effects 



Perry Preschool 
CD Project 



Comparison of 
effects of 3 
distinct 
preschool 
curricula! 
approaches on 
short- & 
long- term 
(actors 



Half-day 
preschool 
sessions and 
home visits I** 
hr wk. 

Tl = Academic 

preschool 

program 

(.V = 23) 
T2 = Cognitivei> 

oncntcd 

curriculum 

(.V =■ 22) 
T3 * Traditional 

Nursery 

School i\ 

23) 



Disadvantaged 
black 3-ynolds 
in Ypsilanti. 
Ml. who 
scored between 
62-90 on 
Stanford Binet 
test 



N 



68 Pern 
65 Control 
group 



Four waves of 
children 
entered the 
project from 
1967-70. 
Students 
randomly 
assigned to one 
of three 
treatment 
conditions and 
then reassigned 
to balance on 
IQ 

(Control gfp was 
Perry 

preschool grp 
from wave 0-4 
in previous 
studv) 



Cognitive Effei ts — IQ 



4 

5 
6 
7 
8 
10 

Age 

7 

8 
IS 



DSTR 




\S 


C7L 


DSTR 


NS 


79 


77 


79 


79 


HS 


HS 


107 


106 


102 


83 


-.06 


-.26 


104 


97 


92 


84 


+ .46 


-.35 


99 


97 


93 


87 


+ ,13 


-.26 


97 


92 


95 


87 


+ .35 


- .33 


91 


92 


90 


87 


" .06 


-.13 


97 


92 


90 


85 


+ .35 


-.13 


St hool 


Achievement (CAT + API.) 


DS1R 


/IS 


XS 


OiliS 




K$ II 


100 


10? 


106 


-.05 




*.I0 


60 


167 


154 


- 14 




- 14 


i< 1 


17 7 


18 4 


.29 




-.07 



School Success 

DSTR II S SS 
Repeat Grade 2(K? I8 r ; 20*; 

Sot nil Behavior (Age 15) 

DSTR li'S SS 
Delinquent 12.8^ 5.4** 6V7 



Louisville (Miller 
& Bizzel. 1983) 



=5 



To compare the 
effects of 4 
distinct 
preschool 
cum'cular 
approaches. 
Longitudinal 
study K-10. 
Program had 
demonstrated 
effectiveness 
and 

represented 

different 

philosophies 

I'oHowup sample 
in 9th and l()ih 
grade 

Random 
assignment to 
groups 



All 4 treatments 
attended prcK 
classes 6.5 
hrs/day for one 
academic year. 

Ti = B-E 

(Bcrcitcr 

Engclman) 
.2 = Darccc 

(Gray) 
T3 = Montesson 
T4 « Traditional 
C = Regular 

Head Start. 

B-E — Academic 
preschool 
careful!) 
structured, 
highly verbal, 
patterned drill 
and practice in 
small groups 

DARCEE — 
Gray's pgin 
focused on 
lang. 

development 
and motivation 
in tcachcr- 
Jircctcd 
format. 



14 preK classes 
in Louisville, 
KY. Students 
assigned to 
T| ■ 64 
T2 - 64 
T3 = 33 
T4 = 53 
N = 214 

Students in 
program for i 
yr. 

Kindergarten yr 
they were in 
different 
program 



Random 
assignment to 
T/C. 
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.) 



10 





77 


12 




T4 


c 




BE 


PAR 


MONT 


TRAD 


C77- 


PPk 
rri\ 


94.7 


96.7 


90.9 


91.8 


89.0 


FPk 


99.8 


96.4 


96.8 


98.2 


90.8 


K 


94.7 


93.2 


93.3 


OS n 


95.0 


1 


91.4 


93.4 


93.8 


93.4 


93.0 


2 


86.6 


89.3 


92.5 


90.8 


92.8 


7 


82.9 


84.8 


87.4 


ha n 

oO.U 


ND 




86.7 


85.3 


88.9 


96.6 


ND 








Mules 






PPK 


97 1 


94.3 


89. 1 


91.3 




KPk 


1(H) 5 


9V8 


91 7 


96.0 




IX 


91 2 


91.8 


' i » 


07 0 




1 


91 1 


94 2 


92 8 


97 < 




1 


90 2 


89 4 


96 6 


94 1 




H 


90 0 


8) S 


92 7 


88 1 




to 


84 * 


78 0 


91 1 


81 9 










i f 
/ i'ltmli's 






Pr*»k 


92 9 


1(H) 0 


92 4 


92 1 






99 2 


97 S 


101 t 


99 4 




fx 


9V8 


92.2 


94 8 


91 4 




1 

1 


on i 


92 2 


0.1 fi 


01 1 




1 


8V8 


89 1 


89 2 


88 8 




8 


84 1 


88 1 


8S 7 


8S 7 




10 


82 1 


88 s 


77 4 


87 1 
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TABLE 4-3. (continued) 



Project Prof>rw\ Focus Treatment Population Study Design Effects 



MONTESSORI- 
Emphasis on 
use of 

self-corrected 
materials and 
discovery 
focus on 
relation 

development of 
mind and 
senses. 
Structured 
curriculum in 
which 

sequence of 

activities 

controlled 

TRADITIONAL 
'1 raditional 
nursery school 
program 
focusing on 
social and 
emotional 
development. 



Kamcs 
(comparison of 
five approaches) 



Comparative 
evaluation of 5 
preschool 
programs for 
children from 
low-income 
families. 

Programs varied 
on extent of 
structure and 
program 
emphasis. 

Differential 
program 
effectiveness 
favored T4 and 
T5on 
immediate 
cognitive 
effects, but T3 
on long-term 
effects. 

No 1Q effects by 
age 16. 



Tl « Traditional 
Nursery 

T2 = Commun- 
ity/Integrated 

T3 = Montessori 

T4 = GOAL 
(Karnes) 

T5 = Direct 
Verbal (B-E) 

T4 = emphasis 
on sensory 
motor with 
concurrent 
emphases on 
exp. lang. 

T5 direct 
instruction in 
prcskills. 

Low parental 
involvement in 
these projects 

Half»day program 
for I yr. 

Second year: 
TI.T2. T3 -> 

regular K 
T4 - > regular K 

+ I hr 
T5 -> intensive 

lang. not 

regular K. 



Low»income 
families from 
Champaign/ 
Urbana. 123 
students in 
original 
sample. 45 in 
post hoc 
comparison 
grp. 

Aged 4 at entry. 
Duration I hr 
and some 
treatment in 
second yr. 



Random 
assignment to 
treatment, no 
control grp at 
the time, post 
hoc group 
constructed 
later 15-yr 
study. 



77 
94.5 
102.6 



T2 
93.3 
98.4 



T3 
94.1 
99.6 



T4 T5 
96.0 93.5 
110.3 109.2 



T4andT5 >TI.T2. T3 .39 



Referred 
Retained 



Comparison post hoc control group 
spec cd 



Percent 
2L6 33.3 
12.9 35.7 



TABLE 4-4. Description of Evaluated Preschool Programs 





Home- 


Based Programs 




Name 


Program Description 


SlaffinulComponents 


Study Design 


Verbal Interact Project 
(Lcvcnslein) 


Home-based early 
intervention program for 
ages 2-4 based on view 


Toy Demonstrator (High 

school ed.). 
Half-hour home sessions 


Comparison with randomly 
assigned control group. 



that conceptual growth built twice/week for 7 months 
on language of mother/child over two years, 
in home. 



ERIC 



Compensatory Early Identification of students who Prc-posi gains over 10-month 

Education Home-Based arc significantly deficient in period. 

Program language development. 

gross and fine motor a.id 
readiness skills. Treatment 
is to provide instructional 
activities, instruct parents 
in home in VA hr/wk 
training session. * • 



Parent Child Early Education Primary focus on early 



Program (PCEE) 
Ferguson, MO 



Portage Project 
Portage, WI 



identification and treatment 
of educationally 
disadvantaged child; focus 
on improving parent 
competencies 

IDENTIFICATION: 
Comprehensive diagnosis 
includes initial evaluation 
by parents and teachers and 
diagnosis by staff specialist 
as needed. 

TREATMENT: Home 
teaching visits to provide 
individualized instruction 

Saturday school-home activity 
guides 

Home teaching program for 
multicategorically 
handicapped birth to 6 yrs 
and for nonhandicapped 
with developmental delays, 
Individualized curriculum, 
parents as teachers, weekly 
data collection on progress 
in home. 



Saturday school-parent home 
visits/teaching carried out 
by specialist in learning 
disabilities. 



Individual assessment. 
Individual planning. Home 

teaching/visits monitoring 

progress. 



Comparison of preIC 
participants with control 
with no preK at entry to 
kindergarten. No evidence 
of initial equivalence of 
groups. 



Comparison of Portage 
Project with lecal preschool 
program for disadvantaged. 

Original certification by JDRP 
on basis of pre-post growth 
scores of 1.3 to 2.3 months 
in areas of physical, 
cognitive, language, 

(continued) 
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TABLE 4-4. (continued) 



Center-Based Programs 



Name 



Program Description 



Staflmi* I Components 



Study Design 



Child Parent Centers (CPC) 



Scton Hall 



Individualized, highly 
structured half-day 
program, parent 
participation emphasized, 
general readiness 
objectives. 

Parent-child visit school 
weekly 2-hr session for 
structured learning 
activities/parent discussion 
group, home activities to 
take away. 



Support services by nurse, 
social worker, speech 
therapist, and curriculum 
specialist 17:1 staff. 



Parent educator. 



Comparison with control 
group of children living in 
comparable conditions. 



Prc-post design. 



Diagnostic/Prescriptive/Supplemental 



Reading Improvement and Students are selected to One teacher, one aide and Pre-post NE gain. 

Preschool participate on basis of volunteers to make ratio 

developmental delays, using 5: 1, 
Denver Developmental 
Screening Test, Program 
consists of balance of 
activities taken from 
guidebook. Children attend 
3 hrs/day 4 day/wk for 160 
days/yr. Training session 
with parents on fifth day. 



Communication Program 



Supplemental 30-min 
period/week in which 
teaching focuses primarily 
on communication. 



Speech/language clinician 
works with classroom 
teacher. 



Pre-post design monthly gain. 



General Academic Programs 



COPE 
Cognitivcly Oriented 
Prc-Primary Experience 



Cognitivcly Oriented 
Preschool Curriculum 



General and academic 
curriculum for preschool 
children, focuses on 
provisions of cognitivcly 
appropriate curriculum. 

Open framework curriculum 
based on Piagct, 
Plan-do*rcvicw sequence. 



Pre-post design monthly gain. 



Teaching team: iwo 
adults/classroom. 



Pre-post design. 
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met for half days and had a low adult/child ratio. Home visits of VA hours 
were made every two weeks with an emphasis on training the parents as 
teachers in a method consistent with the model. 

Students were randomly assigned to treatment groups. Eligible 
students lived within the Ypsilanti public school attendance area, were from 
low-SES homes, were three years old, and were below average in IQ (The 
average IQ range was 62-90). 

Four waves of children participated in the CD project from 1967-1970. 
These waves were called waves 5 through 8, following waves 0 through 4 in 
the original study. 

Wave 5 (n = 27) contained children who had been in the Perry 
Preschool program the previous year cither as controls or as program 
participants, as well as new cnrollccs. Control group children went to the 
language program, new cnrollccs to the unit-based, and Perry Preschool 
program continued in the cognitivcly oriented group. In the original report of 
the CD project, these twenty-seven children were not included. "Since 
children in Wave 5 experienced different educational programs as three year 
olds and were not enrolled for two consecutive years in one of the three CD 
project programs, they are not included in the longitudinal sample." 
However, we note that these students were included in later reports and 
analyses. 

Siblings were assigned to the same programs. Some reassignment of 
children was necessary to maintain equivalence along IQ lines. Of the 
various background features, the only one that was significantly different 
across the three groups was "years of mother's education" (the language 
group had 9.7, the cognitivcly oriented curriculum group had 9 3, and the 
nursery school had 10.9). 

Cognitive measures of the three different preschool groups were taken 
at ages three through eight and ten. The most striking pattern is the large 
jump m IQ for all three groups after entry into the program and a steady 
decline after the initial jump. However, many factors influence the validity 
of these results. First, the predictive validity of IQ scores before age five is 
minimal (McCall, Applcbaum, and Hogarty, 1973). Second, students were 
selected on the basis of low IQ scores, ensuring that the means would tend to 
rise over time due to statistical regression. These difficulties and the fact that 
the means arc based on a small number of cases suggest a great deal of 
caution in interpreting these results. We do note that, in most cases, the 
mean IQ for students in the language group exceeded that of the cognitivcly 
oriented group, by as much as nearly half of a standard deviation (at age five) 
and by a third of a standard deviation (ages seven and ten). These results arc 
not statistically significant given the small sample size, but arc suggestive 
that the language program did have more positive impact on cognitive 
measures than did the cognitive curriculum. 

The recent major finding from this study, which has received a great 
deal of attention, is the claim that there is a connection between type of 
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preschool experience end delinquency at age fifteen. Schwcinhart and 
Weikart (1986) followed fifty-four of the original sixty-eight children in the 
study at age fifteen and obtained self-reports of delinquent acts for these 
groups. They found that the cognitively oriented and nursery groups 
reported engaging in half as many delinquent acts as the direct instruction 
group— 5 for the high/scope, 7 for the nursery school, and 13 for the direct 
instruction group. From this finding, the authors (Schwcinhart, Weikart, and 
Lamer, 1986) have suggested that enrollment in academic preschool pro- 
grams is linked to delinquency. In particular, the lack of autonomy and 
self-direction in direct instruction models is suggested to be related to the 
emergence of different rates of juvenile delinquency. 

How credible arc these claims of different effects by preschool 
program type? Bereitcr (1986) and Gcrstcn (1986) point out several diffi- 
culties with the study design and methodology. 

First, there is the problem that sample attrition from the original 
sample to the fifteen-year-old sample resulted in an uneven distribution of 
females across the three groups. In the original sample (n = 68), the female 
proportion was 59, 52, and 59 for direct instruction, cognitively oriented 
curriculum, and nursery school; in the age-fifteen sample, the respective 
female proportion was 44, 61, and 55. These differences, although not 
statistically significant, arc important because the differences between 
self-reported incidents of juvenile delinquency vary by sex, with males 
reporting far more and more serious delinquent behaviors than females 
(Gottfrcdson, 1987). As Bcrciter (1986) points out, this difference in the sex 
composition of the sample could account for the difference between the 
treatments. Schwcinhart and Weikart, however, replied that the "delin- 
quency scale showed no gender bias, with study males reporting an average 
of 9 offenses and study females an average of 8." They go on to note that 
direct instruction males averaged twelve offenses whereas females averaged 
fourteen offenses. Unfortunately, separate results by sex arc not presented 
to assess the role that differential attrition by sex may have contributed, 
although the authors report that a scries, of two-way analyses of variance 
that controlled for sex, race, and mother's education found the same pattern 
of results. 

Second, some of the children in the nursery and direct instruction 
models had only one year of preschool experience, but all the cognitively 
oriented children were enrolled two years- Again, it is not clear the extent to 
which these duration differences is of consequence. 

A third concern is that observational data collected at the time of the 
study (Scifcrt, 1969) do not suggest that the operating environments in the 
classrooms were in fact all that different in terms of the major intervening 
theoretical variable— opportunities for self-direction and child-initiated ac- 
tivities. 

Finally, the conclusions of this study seem to be at odds with the 
conclusions of other longitudinal studies of the effects of different curricula. 
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Two other longitudinal studies that have made curricular comparisons 
similar to the ones made by the Curriculum Demonstration project (Karnes 
Shwedel, and Williams, 1983; and Miller and Bizzel, 1983) do not find 
differences by curricula. The results of these two studies are summarized in 
I able 4-3. The Karnes studies contrasted five treatments: traditional 
nursery school, eommunity/intcgraicd, Montcssori, GOAL (Karnes) and 
direct verbal (Bcreitcr-Engclman). The authors conclude that no one method 
demonstrated superiority over the others. They also conclude ihat the 
effects of preschool fade out after about the middle of the elementary years 
It docs appear that the initial scores for students in the GOAL and the direct 
verbal treatment were greater than the other treatments, although these 
results do not appear to be long lasting. 

Miller and Bizcll (1983) found an interesting pattern of treatment by sex 
interaction effects in their comparison of the short- and long-term effects of 
participation in four distinct preschool curricula. The four treatments were 
Bereiter-Engclman. Gray. Montcssori. and a traditional nursery school 
They found minimal evidence of effects until the results were analyzed 
separately by sex. The Montcssori males had achievement and IQ scores in 
the range of 0.25 to 0.50 of a standard deviation higher than the other 
treatments. The Montcssori males also had higher achievement at kindergar- 
ten and through the eighth grade. 

Taken together, the three studies do not present a consistent picture of 
the greater effectiveness of a particular preschool model. On the basis of 
these data, it would be difficult to justify or condemn a particular approach 
This suggests that many competing programs may be worthwhile and not 
injurious to children and that other considerations may therefore be more 
important in deciding how to organize and deliver prckindcrgartcn instruc- 
tion, ihis finding is also consistent with evidence for the kindergarten year 
(see Chapter 5 of this book). 



PARTICIPATION IN 
PARTICULAR PROGRAMS 



JDRP Studies 

cvi- 



In addition to these three studies that contrast specific programs, evi- 
dence of program effectiveness is provided by evaluations submitted to the 
u, 'u ?, m Com P c nsatory Education Source Book and from other 
published literature Wc wrote to the developers of the programs listed by 
tnc JURP and by the Compensatory Education Source Book for details of 
program design, operation, and evaluation. Many of the evaluations provide 
only a limited description of the evaluation or did not use a strong evaluation 
design. Virtually none of the studies in the two compilations used random 
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assignment to treatment or control or used a matched control group. 
Therefore, many of these studies should be viewed as illustrative of possible 
effective strategies and be candidates for a more thorough evaluation. 

The studies listed in Table 4-3 are illustrative of the programs found in 
these sourcebooks. They cover a wide range of approaches to early 
childhood education. There are parent involvement programs centered in 
the home (Levenstein and Compensatory Early Education Home Based 
Program) and in the school (Child Parent Centers and Scton Hall), diagnostic 
prescriptive programs (Reading Improvement and Preschool, Early 
Prevention of School Failure), supplemental programs (Communication 
Program), general curricular models (COPE and Cognitivcly Oriented 
Curriculum), and langugc enrichment (Pcabody Language Development Kit 
evaluations). 

We list these studies to give a flavor of the variety of approaches and to 
make a plea for evaluations with adequate control groups so that effective- 
ness might be better judged. Of the studies listed here, typical of the ones 
listed by JDRP and the other sourcebook, effectiveness was judged pri- 
marily by comparing pre-post gains with expected pre-post score gains. But 
such an approach does not control for the operation of other factors that 
influence the growth as well. The two exceptions to this arc the evaluations 
by Levenstein and the evaluations of the Pcabody Language Development 
Kit. In Levcnstcin's case (1970), individuals were randomly assigned to a 
treatment and control group for comparisons on maternal interactive behav- 
ior (the major intervening variable) and to a matched control group on 
cognitive and school success measures. In the case of the PLDK, although 
the findings of various meta-analyses have been disputed, the original 
evaluation of the pilot materials (Dunn, Horton, and Smith, 1981) docs 
provide evidence of its effectiveness. 



SUMMARY AND CONCLUSION 

This chapter has examined the short- and long-term effects of participation 
in preschool programs for disadvantaged students. Two types of studies 
were examined — those that contrasted the effect of preschool versus no 
preschool and those that examined the effect of participation in particular 
types of preschool programs. 

These studies suggest that there are short-term and long-term benefits 
for children from participation in preschool programs. The benefits arc most 
pronounced immediately after participation and exhibit the familiar wash- 
out effect as students progress through elementary school. 

The examination of program effectiveness docs not suggest the 
superiority of one particular approach. However, this conclusion docs not 
imply that schools can therefore open up their doors to four-year-olds and 
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automatically expect the same types of effects as demonstrated by the 
curriculum studies. The curriculum studies o'l used weil-conccptualized, 
highly integrated, very structured, and coherent approaches to preschool 
education. They were quality versions of a particular philosophical ap- 
proach. We actually know very little about the effectiveness of programs as 
they arc currently being implemented, which often fall far short of the 
quality programs discussed here. We do not know, for example, how far 
programs can deviate from the conditions in these studies in terms of class 
size, teacher training, and physical environment before the effects disap- 
pear. It is important to learn if programs, as implemented, arc indeed as 
effective as the research on which they arc based indicates they might be. 

Our knowledge base on which we can form decisions about early 
childhood programs is limited in four important regards. First, we arc 
lacking empirical studies of the relative costs/merits of alternative ways of 
combining preschool and other services for young children. There arc 
pressing child care needs for young children that arc only partially addressed 
by the availability of preschool. We need studies of the costs and the effects 
of alternative arrangements to meet the child care, educational, and emo- 
tional needs of young children and their families. What workable combina- 
tions of day care, before and after school care, and preschool education 
might there be? Under what conditions is preschool a good investment? For 
everyone? In what type of program? Would home-based models, which arc 
considerably cheaper to operate, have the same type of effectiveness? 

Second, we lack sufficient empirical studies of the effects of major 
approaches to preschool curricula. The review conducted here of this issue 
was based on three studies! As important a topic as the curriculum of the 
preschool is certainly deserving of more intense research scrutiny. 

A third limitation is the nature of the methodology used in many of the 
evaluations of program effectiveness. By and large these evaluations based 
their judgment of effectiveness on prc-post gain scores. Of course, this 
procedure docs not control for other factors, beside the program, which may 
have contributed to the gain. More rigorous evaluations arc needed to 
inform the question of the usefulness of most of the curriculum used today. 

A fourth limitation in the available studies is that they often fail to 
consider how preschool is connected to the kindergarten and elementary 
grades. Given that resources for education arc constrained, it is important to 
assess whether placing resources into preschool is more effective than 
adding these same resources into kindergarten or elementary grades. Would 
a student benefit more from attending a preschool program or from attending 
an all-day kindergarten and tutorial program in the first grade? Simply 
distributing a small amount of resources around to provide more services is 
probably not going to benefit very many children— at risk or otherwise. The 
critical question is how the demand for preschool can be balanced against 
the urgent need for so many other educational services for al-risk students. 
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To address this issue fairly, we need more than rhetoric or a handful of 
studies. We need to carry our experimental contrasts of interesting combina- 
tions of services to provide an empirical base for future decisions among 
educational alternatives for young children. 
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Kindergarten attendance is nearly universal in the United States today. 
About 93 percent of all fivc-y car-olds arc presently enrolled in school, 
primarily in kindergarten programs. However, the kindergarten experience 
itself is far from uniform. Kindergartens may be operated by public or 
private schools, rray be academic or developmental in focus, may be in 
session for a full-day every day, a half-day every day, or, more rarely, for a 
full-day every other day. As the first introduction to the formal apparatus of 
schooling for many children, kindergarten is an important experience, but 
one tha; clearly takes on different meaning for different children. 

It is especially important to understand how these different kindergar- 
ten experiences affect students at risk of future academic difficulty. Do these 
students enter first grade adequately prepared to succeed in elementary 
school or do they enter already behind and on their way to failure? What 
alternatives arc there for students who arc not successful in the kindergarten 
year? And finally, what arrangements of kindergarten seem most likely to 
increase the chance of academic success for these students? 

The purpose of this chapter is to describe effective kindergarten 
programs and practices for students at risk of future academic failure. 
Program is defined as a set of procedures intended to be implemented as a 
total package (including curriculum matc.ials, inscrvice training, and strong 
evidence of effectiveness) and capable of being replicated by others. In 
order to be included in this review, evaluations had to present convincing 
evidence of effectiveness based on rigorous methodology. Alterable features 
of kindergarten, such as class size, length of day, and staffing patterns, arc 
also highlighted. 

What students arc at risk of later failure? Is there a single diagnostic 
procedure or a series of procedures to identify such students? The definition 
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and assessment of at nsk is both a political and methodological issue that 
cannot be addressed in detail here. Instead, we leave the definition of at risk 
purposely vague. Students may be at risk because they enter school with 
specific auditory, visual, or other developmental lags. Students may be at 
risk because the linguistic opportunities in their daily lives aie very limited 
and they lack the necessary background and experience in receptive and 
expressive language. Students may be at risk because they lack the 
necessary social/emotional skills to function semi-independently in a group 
setting such as a kindergarten classroGm. Lastly, students may be at risk 
because the school program is inappropriate *br them cither in its approach 
or in its difficulty level. 

The kindergarten year is pivotal for students who may encounter later 
academic difficulties— it provides the basis for their success in the elemen- 
tary curriculum that follows. Once viewed primarily as a year of transition 
and outside fhc realm of the elementary program, today the kindergarten 
year is primarily viewed as an academic/preparatory year (ERS, 1986) with 
clear connections to the elementary curriculum. Most of the programs in the 
public schools are focused cither directly on academics (22 percent) or on 
academic preparation (63 percent). 

These changes in the focus of kindergarten have gone along with 
changes in enrollment patterns and governance structure of kindergartens. 
Kindergarten enrollment has soared from about 5 percent of five-year-olds 
in 1901 to the present 93 percent. \\ the same time, more and more of the 
kindergarten programs (84 percent) arc provided by public schools. There is 
also increasing activity by states to make the kindergarten year mandatory. 
Finally, kindergarten programs, which started as full-day programs but were 
reduced to half-day programs during the baby boom era, arc moving to 
full-day programs again. 

These changes in enrollment, provision, and length of the kindergarten 
day arc occurring concurrently with great pressures to increase the aca- 
demic standards of schools, and a renewed optimism about the efficacy of 
early programs for disadvantaged youths. Also, the movement against social 
promotions has had a general effect on escalation of the curriculum for those 
who arc promoted (Shcphard and Smith, 1985), and this has produced 
increasing demands for accountability for the performance of kindergarten 
students. In the past, when kindergartens were mainly in private schools, 
were attended by only some students, and were mainly focused on socializa- 
tion and adjustment, questions of program effectiveness and accountability 
were of little interest. 

Today, kindergarten touchers need to send the first grade teacher 
"prepared students." If children need to be ready to read in the first grade, 
then the kindergarten is held responsible for that preparation. If chil- 
dren need to be ready to add and subtract in first grade, then the kinder- 
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garten needs to teach them the necessary prerequisite number skills and 
concepts. 

Some commentators are concerned that the push to early academics is 
harmful rather than beneficial to child . . (Elkind, 1986). The stress created 
by the demands of the formal learning situation, rather than benefitting 
students, may well place them at risk of future academic failure. Despite 
these concerns, the pressures for an academic kindergarten continue. In a 
recent survey, 61 percent of public school principals and about the same 
number of kindergarten teachers said the primary focus of their kindergarten 
program was "academic and social preparation" for first grade. About 22 
percent of the principals said the primary focus was on academics. The 
kindergartens in urban areas were most likely to focus on academics. 



PREPARATION FOR FIRST GRADE 

If the major task of the kindergarten is to get students ready to read and 
compute, we need to know what readiness in these areas means. Many 
school districts arc specific about the intended outcomes of the kindergarten 
year. For example. Figure 5-1 shows the twenty-three objectives given by 
one school district. The kindergarten report card for this district indicates 
that kindergarten students are formally evaluated on these stated objectives. 

Another way to examine the academic requirements for kindergarten is 
to look at the typical readiness tests given children in the kindergarten year. 
For example, consider the Metropolitan Readiness Test, a widely used test 
whose validation centered on an analysis of the beginning reading process. 



FIGURE 5-1. Expected Student Outcomes: Kindergarten 



1. Recognize and print name 

2. Name colors and letters 

3. Distinguish beginning sounds 

4. Tell a picture story in sequence 

5. Name six shapes 

6. Name and count objects 0-10 

7. Sequence numerals 0-10 

8. Match numerals with objects 
0-10 

9. Color within boundaries 

10. Know personal data 

11. Fasten and tie shoes and coat 

12. Use scissors with ease 



13. Express ideas and take part in 
group discussion 

f4. Listen attentively 

15. Recognize likenesses and 
differences 

16. Practice self-control 

17. Work and play cooperatively 

18. Follow directions 

19. Complete projects promptly 

20. Obey safety rules 

21. Practice good health habits 

22. Work independently 

23. Participate in organized 
activities 
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Level 1 (early kindergarten) and Level II (late kindergarten) assess the 
following areas: 

Level1 Level 11 

Auditory Skill Area Auditory Skill Area 

I. Auditory Memory 1. Beginning Consonants 

2l Rn y min i? 2. Sound-Letter Correspondence 

Visual Skill Area Visual Skill Area 

3. Letter Recognition 3. Visual Matching 

4. Visual Matching 4. Finding Patterns 

Language Skill Area Language Skill Area 

5 School Language and Listening 5. School Language 
6. Quantitative Language 6. Listening 

Quantitative Skill Area 

7. Quantitative Concepts 

8. Quantitative Operations 

Some of the skills, such as auditory memory and rhyming, may not 
seem directly related to reading. But learning to read requires calling upon a 
complex combination of visual, auditory, and kinesthetic skills. Visual 
perception is required in order to differentiate different letters, such as 
versus m and b versus d. Auditory discrimination of similar sounds, such as t 
versus d. is needed to link the visual to the language known orally. The 
sound-sight correspondence of letters (phonemes) must also be mastered. 
The child needs to understand the concept of a word and syllabic and how 
blcncing of phonemes creates words. Short term memory is important- 
children need to be able to recall accurately the syllables Ihcy have blended 
such as an-i-mal, not am-i-nal. a common occurrence. Thus the kindergar- 
ten goal of preparing children for reading and arithmetic instruction in the 
first grade involves activities and tasks that ma, have little obvious 
resemblance to first-grade activities. Prerequisite skills are not necessarily 
the same skills in smaller dosages or of less difficulty. Mastering prcquisitc 
skills docs not mean working on smaller ditto sheets. Instead, readiness for 
reading and math involves conquering many visual, auditory, and fine and 
gross motor skills that arc necessary in the process of reading, but may not 
seem to be obviously connected to reading. 

KINDERGARTEN SCHEDULE AND 
ACTIVITIES 

In the recent ERS (1986) study of public kindergartens, about three-fifths 
of all teachers stated that they followed -definite time allotments and sc- 
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quencc for each activity." A fairly typical half-day kindergarten schedule 
might be: 



Reading or reading readiness instruction is typically based on a 
commercial reading readiness scries (75 percent of all teachers in the ERS 
survey said they used the readiness scries), and the teacher typically 
provides formal structured instruction during reading and math periods. 

Objectives for the kindergarten year may be explicitly stated. For 
instance, detailed objectives for minimum, average, and above-average 
students may be formulated for comprehension skills (e.g., noting details, 
main idea, sequence, drawing conclusions), and vocabulary (e.g.. phonetic 
analyses, short vowel, word meaning, final consonant, and consonant 
blends). * > an example, the kindergarten objectives for the Baltimore City 
Public Schools covering phonetic analyses of initial consonant sounds states 
that "given a picture of an object or an action and several words, one with 
the same beginning sound as the pictured object, the student will select the 
word that begins the same as the pictured object or action." 



One of the major problems facing the transition from kindergarten to first 
grade is how to assess readiness for first grade and what to do with and for 
students who arc deemed net ready to go on to first grade. There arc varied 
practices for assessing student readiness for first grade work. These include 
teacher recommendation and judgment, results of standardized screening 
and assessment devices, and evaluations by specialized personnel. For 
example, the state of Georgia just passed legislation specifying peformance 
on the CAT as a basis of promotion from kindergarten (Education Week, 
February, 1988). Bases for nonpromotion include student immaturity, low 
attention span, small size for kindergarten, inability to sit still, and retarded 
large/fine motor or language development. Concerning who is most likely to 
experience difficulty in kindergarten, we know that males far outnumber 
females, and that low-SES and minority students also outnumber their 
advantaged peers. The specific reasons fur these referrals arc quite different, 
however. For example, the males may be retained more often because of 
immaturity, whereas low-SES or disadvantaged students arc more often 
retained because of language or other developmental lags. 



8:30-8:35 

8:35-9:25 

9:25-9:45 

9:45- 10: !0 

if:10-IO:25 

10:25-10:55 

10:55-11:00 



Arrival/Get Together 
Reading 

Exploration (free time) 
Math 

Movement/Music 
Social Living/Art 
Dismissal 



WHEN STUDENTS ARE UNSUCCESSFUL 
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Actions taken as a result of the failure to thrive in kindcrgancn 
generally fall into three categories: repetition of the kindergarten program, 
additional time within the kindergarten year, and alternative programs. By 
far, the most common practice is repetition of the kindergarten year, cither 
in the guise of pre-first, transition, or junior first-grade classes. This 
approach is based on the belief that children who fail to prosper in the 
kindergarten year are simply "young" for their age. and by letting them 
mature, they will be able to perform adequately and even blossom. 
However, the evidence on student grade repetition (discussed later) offers 
no support for this view. 

A second approach has been to provide more time within the same year 
for kindergarten students, usually by extending the kindergarten day. There 
are several variations on this approach. One is to screen children and give 
only some students additional remedial or enrichment instruction. Another 
way is to add time for an entire school system that has a high percentage of 
at-risk students. Finally, many school systems that do not have a high 
percentage of at-risk students arc extending the school day for kindergarten 
students anyway. We discuss the effectiveness of the increased time 
approach in the next section (sec Karweit, 1987). 

The third approach involves screening and assessment of children for 
learning problems prior to entry to kindergarten and the delivery of a 
specialized curriculum to suit their needs. This approach differs from the 
other two in its assumptions about learners and the role of the school and the 
personnel needed. It assumes that children learn in different ways and 
through different modalities and styles and that intervention strategies arc 
needed to address these distinct areas of strength and weakness. 1 his is not 
just individualizing instruction according to the level of difficulty of the 
material or rate of learning, but according to the learning avenues best suited 
for a particular child (e.g., visual, auditory, kinesthetic). 

We will now examine the effects of these approaches. 



PROMOTIONAL PRACTICES 

One of the outgrowths of the 1983 reform movement has been a renewed 
focus on standards and a renewed interest in nonpromotion as a way to 
achieve these standards. However, nonpromotion has not been supported as 
an effective policy (Holmes and Matthews, 1984; Jacket, 1975; Niklason, 
1984; Shcphard and Smith, 1985). Grcdlcr (1984), after examining the effects 
of transition rooms for students deemed unready for first grade, concludes: 

Analysis of the research studies of transition rooms raises questions about the 
degree of educational ••payoff" obtained with such programs. Research 
indicates that transition room children cither do not perform as well or at most 
arc equal in achievement levels to transition room eligible children placed in 
q regular classrooms" (p. 469). 
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Research findings notwithstanding, schools continue to retain students as a 
remediation strategy, especially at the early grades. Part of the reason for 
continuation of the practice may be that schools cannot locate other 
alternatives. Also, teachers may view the practice as effective — retained 
students do make some gain during the retained year, and teachers arc 
unable to compare this gain during the retained year to gains the students 
would have made had they been promoted. 

Shcphard and Smith (1985) took advantage of the existing variation in 
school kindergarten retention rates to address this issue. They noted that 
many previous studies were flawed methodologically because the compari- 
sons were not of equivalent students under different policies, so they 
sampled same-sex students with similar birthdatcs, family backgrounds, and 
entering test score data from schools with contrasting retention rates. They 
compared retained students with nonrctaincd students. The results were 
striking. Students who had spent an additional year in kindergarten were 
basically identical to those control students who had been promoted. The 
only notable difference was that the repeaters scored one month higher (1.9 
versus 1.8) on the CTBS reading comprehension test taken at the end of the 
first grade. One month gain for one year docs not seem like a very 
economical practice.* 



EXTENDING THE KINDERGARTEN DAY 

Karwcit (1987) examines the effects of full-day versus half-day kindergarten 
in detail. Table 5-1 summarizes the individual studies reviewed in that 
paper. Figure 5-2 provides an indication of the direction of effects by the 
adequacy of the study design and the population served. 

Figure 5-2 suggests where the effects of all-day kindergarten programs 
are located. Disadvantaged students who receive additional instruction arc 
the primary source of the positive effects. Nine studies focused on the effect 
of full-day kindergarten for underachieving and disadvantaged students. Of 
the two strongest studies (using random assignment), one showed significant 
effects for the full-day kindergarten treatment. T!ic other seven studies fell 



* One may take issue wiih the equivalence of these groups. Many referrals to repeat 
kindergarten arc based on behavioral indicators that were not equated. Also children 
grow at very different rates during the kindergarten year. The entering test score may 
not be a very good proxy for where the students were at the end of the year when the 
assessment for promotion was made. Finally, there may be school level differences 
that drive the differences in retention rates whicl, may also affect performance of the 
students. Policies such as providing students special help in the summer or 
after-school tutors, rather than retaining them, might have given the nonrctainccs 
^-^itional resources as well. 
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TABLE 5-1. Summary of Effects of Full-Day versus Half-Day Kindergarten Programs 



Random Assignment/ Matched Control Group Studies 


Study 


Sample Treatment Effects 


Effect 
Size 


Notes 



Johnson (2974) 
Pre-post 

Random assignment 
Replicated 
Longitudinal effects 



Princess Anne, MD. 
20 students 
matched on age, 
race, SES, sex, 
and ability 
assigned to TRT 
(full-day) or CTL 
(half-day). 

3 experiments: 
1970, 1971, & 
1972 



Winter and Klein 
(1970) 

Screened; then 
random assignment 
to treatment/ 
control 



Two studies: 
I) Disadvantaged. 
Treatment and 
control selected 
from lowest HY/r 
of class. 

TR T: /i=6 

CTL: /i "7 



EXP=full-day 

(5' 1 5") 
CTL=half-day 

(2'30") 
Same curriculum. 
Enrichment given 

full-day. 

Measures: 
Walker 

Readiness 
Stanford 

Achievement 
Reading group 

grade i 



Walker 
Cohort I 
2 
3 

Stanford 
Cohort I 

2 

3 

Reading Grp 
Cohort I 
2 

3 



.66p < .05 
.58 ns 
.08 ns 

.13 ns 
.28 ns 
.59 ns 

.00 ns 
.06 ns 
.57 ns 



CTL=attcndancc 

am/pm 
TRT=rcgutor+90 

minutes academic 

pgm 
No pretest 

difference. 



Metropolitan at 

end of K +3.01 p<.0<)5 
Stanford at end of 

K .62 ns 

Stanford at end of 

1st .62 ns 





2) Advantaged. 


CTL=attendance 


Pretest Peabody 








Selected trt & 


regular 


Picture 


1.28 p<.05 






control from 


TRT=regular+90 


Metropolitan at 








those most able 


minutes academic 


end of K (adj 








to benefit. 


pgm 


post) 


- ns 






TRT: n=26 


Significant pretest 


Stanford at end of 








CTL: «=29 


differences 
favoring TRT 


K (adj post) 
Stanford at end of 
1st (adj post) 


- ns 

1.03 p<.05 




Oliver (1980) 


Cambridge. MA. 


EXP=full-day with 


Clymcr-Barrctt 




Effect size inflated 


Pre-post 
ANCOVA 

No nrpfpcf HtfY7>rfnpf'c 

|JIVIVal UII Iwl L IILw > 

Comparable program 


61 students in 4 


structured curric. 


Prcrcading 




by use of class 


classes half-day. 


1 17 minutes/day 


Inventory 


2.84 p<.05 


means 


98 students in 6 
classes full-day. 


CTL= half-day with 
.same structured 
curriculum 83.8 
minutes/day 


Murphy-Durrcll 
Pre reading 


1.16 p<. 05 





Nonmatvhed Groups! Pre- Post Studies 



Carapclla and 
Lovcridgc (1978) 

ANCOVA 

Both groups eligible: 
control group of 
nonparticipants 
who were eligible 
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St. Louis public 
schools. 

507 students who 
scored below 50th 
percentile on CPI 
who were eligible 
for attendance a! 
extended day 
kindergarten. 

273 enrolled 

234 control 



Supplementary 
instruction for 
kindergarten 
pupils using 
small-group and 
individual 
instruction in 
extended day. 



Comprehensive 
Test of Basic 
Skills 

Mathematics 

Reading 



.41 p<.(K)l 
.32 p<.00l 



(continued) 



TABLE 5-L (continued) 



Nonnnttched Groups! Pre-Post Studies 



Study 








Effect 




Sample 


T reotment 


Effects 


Size 


Notes 


Nieman & Gastright 


351 kindergarten 


EXP=full-day K 


Pretest (Sept. 






(1981) 


students in 16 


(//=4J0) 


Kinder "Goal 






Existing sample 
Longitudinal 
Post only, with 
evidence of initial 


Cincinnati schools 
receiving Title I. 


CTL= half-day K 
(//H4I) 


card") 
Bochm (Dec. 


NS 




Full-day students 


EXP also had 


Kinder) 


.35 p<.00l 




had preschool 


preschool 


Metropolitan 






equivalence 


experience, 
half-day did not. 


** 


(April Kinder) 

Metropolitan 
(4th grade— 
70% sample) 

Metropolitan 
(8th grade— 
50% sample) 

Grade retention 

Special education 


.35 p<.00l 

,25 p<.0l 

.25 p<.0l 
.13 p<.01 
.25 p<.001 





Hatcher (1980) 

ANCOVA 
Ad hoc sample 



ER?C 



4 school districts in 
Texas. 2 having 
half-day K and 2 
having full-day K. 
60 students 
selected at 
random. 



Half-day vs. 
full-day. No 
information on 
curriculum oi*ton 
differences in 
treatments. 



Metropolitan 

Readiness 
California Test of 

Personality 
Valctt 

Developmental 
^ Survey Basic 



ns 



ns 



ns 



Adcock (1980) 



Jarvi.s and toolnar 
(1986) 



189 urban and rural 
kindergarten 
children in 5 
Maryland local 
cd. agencies. 

Comparison of 
existing full-day 
and half-day Ks. 



EXP=full- 

day(/i= 131) 
CTL=ha1f- 

day(«=58) 

Measures: 
Metropolitan 
(pre and post) 



Results ANCOVA 
Post = pre + K 
type 



.56 p<.00i 



Estimated setting 
t=3.09, minimum 
value for p<.C0l 



New York City. 
1807 full-day K 
223 half-day K 
Citywidc conversion 

to full-day K. 
Half-day were ones 

unable to convert. 



Contrasts: 
Half-day/ 

language 
Full-day/ 

language 

Measures: 
Brigancc 

Pre/Post 
LAB Pre/Post 



Results ANCOVA 
Brigancc 
English 

speakers 
Non-English 
speakers 
LAB 



.09 ns 

.45 p<.()5 
.38 p< .05 



Evans and Marken 


Metropolitan school 


Contrasts: 


Results ANCOVA 


(1984) 


district in Wash. 


Ful1-day(/;=87) 


CAT - ns 




state. Mostly 


Half-day(/i=87) 


Reading 




white. 

middle-class. 
174 1st. 2nd. 3rd in 
2 diff. clem, 
schools who had 
different 
kindergarten 
pgms. 


Measures: 
Ability test (K) 
CAT (1.2. or 3) 
Early Chd 
School 
Sentiment 
Teacher ratings 
Reading attitude 


attitudes ♦ 
Referral special 
education -.26 p<.05 




(continued) 



S TABLE 5-1. (continued) 



Nonmatehed Groups/ Pre-Fost Studies 


Study 


Sample 


Treatment 


Effects 


Effevt 
Size 


Notes 


Dcrosia (1980) 


Jefferson City, CO. 

384 students in 
kindergarten. 1st 
and 2nd grades 
having full or 
half-day 


Full-day (/i=67) 
Half-day (/i=93) 


Boehm (adjusted 
for pretest, 
SES, 
age) 

CTBS (Grade I) 
CTBS (Grade 2) 


.36 p<.05 

ns 

ns 





kindergarten. 



Warjanka (1982) 



9 
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30 students who 


Six-month 


At pretest, FDK 


scored < 65 on 


treatment, regular 


group I 


Metropolitan 


kindergarten 


standard 


Readiness Test 


+ extended day 


deviation lower 


and 


Curriculum based 


than other 


40 students who 


on participant's 


group (37.8 vs. 


were in same K 


ability 


20.5) on 


classes with 




Metropolitan 


scores >65. 




Readiness Test 






After 6 months of 






treatment, EKD 






group and 






regular group 




were same 






(54.3) 



(+) 





70 siuaenis wno 


Additional 


Pre-post design. (+ ) 






u/nrp i nrlpnt ifif*fl 
Wvlt 1 IIUwIlllllCU 


instruction \ 1 i7 


At nrat net 

j\\ preicsi, rUK 






as at risk 


to 242 hours). 


group signifi- 






nnrl 


^mall/»r r*lncc/»c 


canny lower 






171 UIIICI l\ 


1 J. 


than reg. group 






students 


Curricular 
change — whole 


on CAT 
listening 








language 


skills subtest in 








approach 


NCEs (24 vs. 
45). At posttcst, 
FDK made 
significant gains; 
reg. i;roup de- 
clined (36 NCE 
to 42 NCE). 





Lysiak and Evans 
(1976) 

Convenience sample 
Replicated two years 



Fort Worth, TX. 
916 students in 1 1 1 
K classes 



Comparison of 6 
curricular 
models, for 
students of 
differing SES, 
ethnicity, and for 
full-& I /2-day 



Rill-day > 
half-day for low 
SES and for 
high SES. 



(♦) 



Alpcr and Wright 


Phoenix, AZ. 


Full-day had longer 


Metropol. Readi- ( * ) 7 


No significance 


(1979) 


98 students in 


day (5 vs. 2 1/2) 


ness Test 


levels computed 




kindergartens in 


and smaller 


Extended day > 




extended day and 


classes (12-25). 


regular 






regular 


Teacher visits to 


No report of 








homes. 


significance 








Three-month study. 


level 





ERIC 



>1 
JL 



(continued) 



^ TABLE 5-1. (continued) 

ON 



Nonmatehed Groups! Pre-Post Studies 



Study 












Sample 


Treatment 


Effects 


Size 


Notes 


Humphrey 


Evansville- 


Contrasts; 


Reading Gates 




Significance not 


(1983) 


Vanderburg 


2 cohorts 


Mnpfiint ip 
iviaivjiiiUv 


7R 7Q t±\ 


reported 




School District 


78-79 full = 8! 

/O / / lull Ul 


V~ 1 DO 


/7— oil (*r ) 






half= 108 


\Jl uUv 


/o— /y { "v ) 








79-80 full= 1 15 


Retention 


78-79 (+) 








half=HI4 




199? Half ( + ) 












9 Full 




McClinton and 


80 1st graders in 10 


EXP=4'I5" 


CAT at end of K 


- ns 




Topping 


public schs 


CTL=2'40'' 


CAT at end of 1st 


- ns 




( IVo4) 


randomly 


Major difference 


Teacher 






Post only 


selected* 


was amount of 


ratings 






No evidence of initial 


EXP= enrolled 


time, not 


academic 






equivalence 


EKD 


cm rriculum 


ability 


1 n<' t\S 
1 i*f ». \f K - .Vi.i 




CTL= enrolled rcg 




FXP>CTI 












|'(| ,9) - 5,15 


















Harman (1982) 


55 half-day, 66 


Posttcst-only 


CAT: reading 


K21 ns 




Convenience sample 


full-day in K 


design. 


math 


.40 p',,05 




classes in same 


Comparison of CAT 










school and 


reading and math 










matched on 


at end of year. 










ethnicity. 




• *■ 







£ mobility, & SES 
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L^nicago s oovi 


1 10 schools. 


Contrasts: 


Percent scoring in 


Funded 


Comparison of 


Funding source: 


first quartile 11 BS 


Kindergarten 


existing programs 


Chapter 1 






Programs 




Chapter 2 


HDK, chpt 1, size 16 = 26% 


Convenience sample 




OEEO 

Board Funded 
Format: 

all-day 

half-day 
No pretests 


ADK, chpt 2, size 23 = 39% 
ADK size 26 = 46% 
OEEO size 28 = 51% 
HDK, board, size 28 = 73% 
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FIGURE 5-2. Summary of Effects for Full-day Kindergarten by Quality of Study, 
Immediacy of Effect, and Population Studied 



Random Assign 
or Matched 



Nonmatched 



Regular or 
Advantaged Students 



Disadvantaged 
Students 



Kindergarten Long term Kindergarten Long term 


Johnson + 
Oliver + 
Winter 0 


Johnson 0 
Winter 0 


Johnson 0 
Winter + 


Johnson 0 
Winter 0 




Hatcher 0 
Adcock + 
Jarvis 0 
Derosia + 


Evans 0 
Derosia 0 


Carapella + 
Niemann + 
Jarvis + 
Warjanka + 
Slaughter + 
Lysiak + 
Entwisle + 


Niemann + 



into the less methodologically rigorous category, and all of these found 
positive effects for all-day kindergarten. 

There arc no long-term effects demonstrated for attendance at full-day 
kindergarten. Only one study (Nicman and Gastright, 1981) found significant 
long-term effects, but limitations of this study limit the credibility of the 
results. First, the study compared students who had preschool and all-day 
kindergarten to those in half-day programs, a somewhat different compari- 
son than in the other studies. Second, their test for equivalence of the two 
groups at entry into kindergarten *vas of unknown validity and reliability. 
Finally, the long-term results in grades 4 and 8 included only 70 and 50 
percent of their initial samples. Sample attrition may have been differentially 
important. 

Other studies focusing on the effects of compensatory efforts (Lazar, 
Hubbcll, Murray, Roschc, and Roycc, 1977; McKcy, Condclli, Ganson, 
Barrett, McConkcy, and Plantz, 1985) have found that the results of the 
extended day/year arc primarily immediate and not long-term, and our 
findings support this conclusion concerning the effects of full-day kinder- 
garten. 

The finding that full-day kindergarten programs seem most effective on 
short-term measures for disadvantaged populations raises many new ques- 
tions. To what extent is this finding due to differences in the sheer amount of 
time in school or due to differences in program emphasis and focus? It seems 
possible that a combination of more time and greater emphasis on academic 
preparation is important. Studies linking the allocation of time to differences 
in achievement results typically find only modest results (Karwcit, 1983). 
One primary reason is that the same allocated time can have quite varied 
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actual usages in different classrooms, depending on the grouping patterns, 
the curriculum, the teacher, and the students. 

An observational study of kindergarten instruction in three school 
districts by Meyer (1985) illustrates this point. Contrasting the use of time in 
districts that have half-day and full-day programs, Meyer showed that the 
actual amount of time on academic matters was not all that different in the 
full- and half-day programs observed. The total minutes allocated to instruc- 
tion in the half-day classes (150-minutc sessions; was 78 minutes, whereas in 
the full-day classes (330 minutes) the total instructional time allocated was 
103 minutes. In general, the students in the full-day programs had more total 
minutes allocated to instruction, but some teachers in the half-day schedule 
actually exceeded the allocated time of some teachers in the full-day 
schedule. Again, individual teacher practices and curricula seemed to be 
important elements in determining how the school day v;as spent. 

This suggests the importance of understanding more than the effects of 
the length of the kindergarten day. What instructional programs arc effective 
for kindergarten students? What difficulties arc there in operating these 
programs in a full-day or half-day setting? Is it possible to have effective 
half-day programs and thus save the considerable expense in expanding the 
kindergarten? And if districts decide to extend their kindergarten day, what 
programs have been demonstrated to be effective? Do they require a full day 
for successful implementation? 

The major conclusion from examining the effects of full-day kindergar- 
ten is that attendance at full-day programs appears to be beneficial for 
disadvantaged students. The source of this effect — whether it is simply more 
time in school or a change in the focus of the kindergarten program that 
accompanied a full day — is not clear. In the next section, we focus on the 
nature of the programs that seem to be effective for disadvantaged kinder- 
garten students. 

EFFECTS OF PROGRAMS OF INSTRUCTION 

There arc two major sources for the programs reviewed here: programs 
approved by the U.S. Department of Education's Joint Dissemination 
Review Panel (JDRF), and programs listed in the Office of Educational 
Research and Improvement's Effective Compensatory Education Source- 
book (Griswold, Cotton, and Hansen, 1986). The studies/programs arc 
classified and presented (sec Table 5-2) by the adequacy of the research 
design. Programs that used random assignment to treatment and control 
groups were weighted most heavily, followed by studies that used a 
matched/experimental control group design. Cohort, or before- and after- 
implementation designs, are discussed next. Given the least weight are 
studies that base their evidence of effectiveness on comparisons of expected 
fall-to-spring growth, or on posttcst-only effects. 

erJc ,: ) 
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Assessments of effective early childhood programs not only suffer 
from all the methodological difficulties discussed for elementary studies (see 
Madden and Slavin, 1987), but include a few of their own. One, it is much 
less clear what the goals of kindergarten programs may be, and there arc 
fewer reliable measures of the goals. Measures are often homegrown tests of 
unknown reliability. When tests of known reliability arc used, there is little 
consensus about which to use. For example, the twenty JDRP-approvcd 
programs for kindergarten used twelve different standardized tests. 

Furthermore, the test selected may not match the objectives of the 
program. For example, in a school with an outstanding prekindergartcn 
program, the principal was asked her primary goal for the prekindergartcn 
students. She was very quick to respond, "To make them articulate. " Yet 
no test or measure of the children's expressive language was used in 
evaluating the program's effectiveness. Instead, the program effects were 
measured using standard paper and pencil instruments. 

Also, the evaluations can render false positive effects if they measure 
skills that improve test-taking performance on narrow educational goals, but 
not skills that lay the foundation for future learning. For example, children 
can be taught to recognize numbers and count to fifty without having the 
necessary skills to master addition. Or children can learn at an early age to 
recognize letters, but still not possess the necessary language skills to learn 
how to read. An evaluation that shows mastery of discrete components 
related to reading and the separate skills related to math comprehension 
docs not show competency in reading and math- There needs to be 
integration of the skills and the necessary linguistic and numeric background 
for the skills to be applied. 

Finally, kindergarten programs may have very different goals, so that 
comparison of treatment and control groups can be misleading. Comparing 
an academic kindergarten program to a traditional control group that is 
basically nonacademic in orientation, one should not be surprised to find 
large effects on readiness activities for the academic program, since the 
traditional program did not intend to teach these objectives. This problem of 
program goal is particularly at issue for kindergarten programs because the 
purpose of kindergarten has been changing over time and evaluations 
indicate very little about the curriculum for the control group. 

Tabic 5-2 provides a synopsis of the kindergarten programs for which 
we have evaluation data. As noted, these programs are presented by the 
adequacy of the research design in the following order: 

1. Random assignment 

2. Matched control group 

3. Cohort comparison 

4. Spring-to-spring growth (no control) 

5. Fall-to-spring growth (no control) 
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Wc consider studies that used random assignment, matched control 
groups, or cohort comparison groups to be methodologically adequate 
studies. Effect sizes arc presented for these more adequate studies. The less 
rigorous methodological studies/programs arc presented along with the 
general direction of effect. 

Alphaphonics/Astra's Magic Math 

Alphaphonics/Astra's Magic Math arc two widely used and successful 
beginning readiness programs. Combining systematic, sequenced lessons 
into a game-like format, both programs arc motivating and fun yet still 
provide abundant practice and repetition of presentation. Friendly visitors 
from outer space (Astro for reading; Astra for math) leave a bag of lesson 
materials daily for the teacher and children. The suspense and anticipation 
derived from the magic bag appear to sustain student interest and moti- 
vation. 

In the Alphaphonics program, the letters of the alphabet arc introduced 
sequentially one at a time, in a twenty-six-weck sequence. Astra's bag 
contains items that start with the letter the class is studying, such as apple, 
(plastic) alligator, alarm clock, and an abacus for the letter A. Badges, 
stickers, and letters to parents arc also in the bag. 

There arc six lessons for each letter of the alphabet. The student learns 
to name a letter, then to write it, and then to locate the upper- and 
lower-case example of the letter. Astro manages lo create and keep a fantasy 
and fun-like atmosphere for the children while getting them to practice and 
review. Astro also brings ditto sheets or other tools for independent 
practice. 

Astra's Magic Math uses a similar outer space theme for the introduc- 
tion of twenty-two math concepts in a sequenced manner. The units arc 
introduced to the whole class. The twenty-two units cover shapes, match- 
ing, size comparison, counting and recognition of the numbers 0 to 30, 
number sequences, addition and subtraction of the numerals 0 to 5, and time 
in hours. 

The Alphaphonics program takes about an hour each day. A typical 
schedule would be 

1 . Sing alphabet song 

2. Sing poem song (for particular letter group) 

3. Class discussion 

4. Individual work 

5. Sing poem song 

The evaluation of the program used a posttcst-only analysis of vari- 
ance. The treatment classrooms were compared with twelve control schools 

Er|c -7 
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TABLE 5-2, Infective Kindergarten Programs 



Programs Evaluated with Random Assignment or 
Matched Control Group Design 



Jl)Rl> 



Nome 



Developer Grade Content 



Instructional 
Strategy 



74-15 Aiphaphonics So. San 

Francisco 

u.s.c. 



K Reading Readiness phonics 

Rcadi- program focusing 
ncss on sequential 

learning, 
immediate 
correction, 
feedback. & 
game-like 
presentation for 
about I hr/day. 



83-54 



Astra 
Math 



77-116 MKC'CA 



So. San 
Krone isco 
U.S.C. 



Trumbull 
Public 
Schools 
(CD 



K Math Comprehensive. 

Rcadi* structured. & 
ncss sequenced 

curriculum with 
22 self-contained 
units. Uses 
multiscnsory 
approach & 
behavior 
modification & 
high-interest 
materials. 

K Development & 

implementation 
of early 
identification 
procedures & 
preseriptive 
educational 
programs for 
children entering 
K with specific- 
pot entia I 
handicaps. 
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Evaluation 
Design 



Measures 



Kflvris 



Costs cC- 
Traiu'uiK 



Adoptions 
cC- Activity 



Post ANOVA on 
treatment sch. & 
remaining 12 in 
district. 

Stated that IQ & 
background of T 
& C equivalent. 



Metropolitan 

Ach. 
readiness 
ach. Gl 
ach. G2 
ach. G3 



,89 
1.14 

.90 
1.07 



$135 for 6000 cities 
materials 50 states 
classroom 

I -day trng; 
pay trnr 
honorar- 
ium & 
travel 



All students there 
for K-3. 

Pre-post random CTBS 
assignment to fall-sprg 
treatment — control 
3 classes each. 



45 
(adj) 

.30 

(not adj) 



$112 for 4000 cities 
materials 30 states 
classroom 

I -day trng 



Pie-post random JANSKY .67 .57 No data No data 

assignment to Metropol. M 

mint & control. Monroe .96 



(continual) 
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TABLE 5-2. (continued) 



Programs Evaluated with Random Assignment or 
Matched Control Group Design 



Jimp 



Name 



Developer Grade Content 



Instructional 
Strategy 



7K-IK9 TALK 



79-7 MARC 



Wakulla City. 
Crawford- 
villc. KL 



75-37R INSTRUCT 1. 



Rockford, IL, K-3 Lang, Lang, specialist in 
school class instruction 

system in listening skills 

4 wk 1/2 hour for 
6 mos then 
classroom tchr 
continues 
lessons. 

K-l Rdg. Continuous 

progress using 
multisensory 
activities & 
systematic 
instruction. 
Diagnostic & 
recordkeeping 
instrument, skill 
sheets provided. 

ncoln Public K-3 Rdg, Individual 
Schools placement & 

progress through 
multiunit model 



79-38 PLAY Bristol. VA K-l and Motor/ Diagnostic/ 

3/4 Cogni- prescriptive 
years live direct instruction 

in perceptual/ 
motor monthly 
home 

reinforcement & 
activities. 
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Evaluation 






Oaf a <£ 


Adoptions 


Design 


Measures 


Effects 


Training 


& Activity 


p r ~ w i a \icr\\f a 

rTc-pOSl AiNl^UVA 






C<n manual 


572 dists. 


Ull ilCallllClll oc 


76 


.42 


tchr & 


33 states 


matched local 


WISC 75 


.38 


sub time 


6 int'l 


control. 


76 


.46 


for tchr to 




Original study: 


PPVT 75 


.26 <K) 


attend 


Currently 


(75-7f* 


76 


.74 


I /2-day 


active 


Replication: (76-77) 


(K) 75 


.38 (K) 


irng 






76 


.55 






Post ANOVA on 


SESAT (II) 


#! #2 


3/5 day trng 


No exact data 


treatment & 


letters 


I. 12 .55 


or 


(FL&SC) 


matched local 


word rdg 


.88 na 


sclf-trng 


Not active at 


control at end of 


sent rdg 


.25 na 


manual 


present 



K & end of 1st. 
Pre ANOVA to 
ensure 
equivalence. 



BOEHM 

KUHLMAN 
ANDER- 
SON 

SESftT(l) 



ns 



ns 
ns 



ANCOVA on trmt & 
comparable schs. 
chosen on similar 
SES. school org & 
#s compensatory 
students. 

ANOVA on 
treatment and 
control. Control 
were cligiblcs 
(score below cut 
off) not enrolled 
because positions 
filled. 



Metropolitan 
word know 
rdg. 

spelling 



BOEHM 

75- 76 

76- 77 

77- 78 



.35** 
.25* 1 
ns 



1.77 
.23 
1.33 



5 days trng No data 

materials 
SlOO/class 



na 



Not active 



(continued) 
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TABLE 5-2. (continued) 



I'roxrauts Evaluated hv Comparison with 



li\pe<ted (irowilt, National Norms, or f alhio^Sprin^ (irowih 












itlxfn tt't it ami 


Jimr 


Name 


Developer 


(trade 


Content 


Strategy 


81-44 


CLIMB 


M iddlesex. 


K-12 


Rdg. 


Diagnostic/ 






NJ 




Math 


prescription 












approach in 












acquisition of 












rdg Si math 












skills 












providing a 












mngmt design 












for 












coordinating 












& integrating 












classroom & 












support 












personnel. 


76-87 


STAMM 


Lakcuood. 


K-8 


Math 


Continuous 






CO 






progress math 












with mngmt 












system. 


78- 184 


lulucntion 


Kenosha. 


K-IO 


Lang. 


Kuended'day K 




assessmt. & 


Wl 






2-3 hrs in 




Inst, for the 








afternoon. 




Educa- 








additional 




tionally 








time for 




Deprived 








remedial 












instruction. 


78-198 


livery Student 


St. Marv 


K-4 


Math 


Daily diagnosis. 




livery l)a> 


Parish Sch. 


K-6 


Rilg. 


evaluation & 






Bd. 






prescription. 












computer 












scoring for 












coordination. 












PuIIot design 












using 40 ruins 












each day. 



0 
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Evaluation 
Design 


Measure* Effects 


Costs & 
Training 


Adoptions 
iV- Activity 


Spring-to-spring 
achievement 


CTBS + 




781 (1982-6) 
from sponsor 



compared to 
nat*I norms & 
compensatory 
growth. 



Pre-post 
implementation 
scores for 
district & 
adoption site. 



CAI 



# stdts x 7, avg 
startup. # stdts 
x c if wkbks 
used as 
consumables 



41 states: 15(H) 
adoptions 



Pre-post design, 
f-all-to-spring. 



PPVT 



Pre-post design TOBIi 
changing C A tile § (presch. 
fall-to-spring & K) 
7 52(76) 
7 59(77) 
7 32(82) 
2 40(83) 



(conitnticd) 
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TABLE 5-2. (continued) 



Programs Evaluated hy Comparison with 
lixpeited Growth. National Norms * or iall-to-Spring Growth 

Instructional 

J DIM* Name Developer (trade Content Strategy 

74-102 Baptist Hill K Baptist Hill K. K Rdg. Full-day K. Irng 

Greenville. Math eirs. 

AL diagnoses 

individual Irng 

needs on 

continuous 

basis with 

appropriate 

learning 

activities. 

74-46R l-arly Peolone 4. 5. 6 I-nrly 

Prevention District year identification 

of School Illinois olds of 

Failure developmental 

needs & Irng 
styles of 4. 5. 
& 6 yr olds. 



Screening, 
planning. & 
pullout 20-30 
min 

instruction in 
different 
modalities in 
Irng ctrs. 



ERIC 



i 



Effective Kindergarten Programs 129 



Evaluation CtnU <* Adoptions 

Design Measures Eff ects Training <&. Activity 

Pre-post design. TOBE * No data 

l ; ;t!l-to-spring. 

3rd 6th 
stanine stanine 



Improvement per getting • 2-day ting MMMMfiom 

month on data sponsor) 

different SI27/clsrm teacher 

scales— no materials 
comparison \ 
data cither w/ 
a control grp 
or prcimple- 
mentation 
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TABLE 5-2. (continued) 



JIWP 



Programs Evaluated hy Comparison with 
Expected Growth. National Norms, or Fall-to* Sprint* drou th 



Name 



Developer (trade Content 



Instructional 
Strategy 



84-1 



74-71 



74-75 



First Level 
Math 



PRIMAK 
Educational 
l : ndt. 



New Moore l-lem. 

Adventure Sen., 
in Learning Tallahassee. 
IL 



Strategies in 
Karly 
Childhood 
Education 



Waiipun. Wl 



K or I M;ith 



pieK 
and K 



Sequential 
curriculum & 
mngmt system 
which is 
diagnostic/ 
prescriptive. 
Instructional 
grps formed 
on basis of 
pretests. 
Instruction in 
3-4 grps for 
about 20-30 
mins. 

Individually 
determined 
instruction 
with positive 
behavior 
mngmt. 

Developmental 
& screening 
model. Self- 
instructional, 
individually 
paced . 

learning clrs. 
develop- 
mentally 
sequenced 
materials. 
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Evaluation 
Design 


Measures 


Elicits 


Casts & 
Training 


Adaptions 
& Activity 


Pre-post design 


CIRCUS 


+ 


$3S/kit 





Fall-to-spring. not 

possible 
to 

compute 



Pre-post nat'l PPVT— mean imprvmt No chit, data No cut r. data 

norm 1.67/math Gilmore 

comparison oral rdg test — 

using expected on grade level at pre, 
growth. 579$ at post 



Ad hoc 10 sessions 

comparison of in service 

trtmt children 
with another 
grp. 

No evidence of 
pi 101 

eompuiuhiliH 
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TABLE 5-2. (continued) 



Programs Evaluated by Comparison with 
Expected Growth. National Norms, or Falhto-Sprhtf* Growth 



JDRF - v «""' Location Serves Area Description 



74-93 Right to Read Glassboio. K-1 Reading Diagnostic. 

piesctiplive, indiv. 
piogiess model, 
ungiadcd. 



Project Catch 
Up 



Newport 
Mesa, 
CA 



Amphitheater Tucson. 
School AZ 
District 

KIP 



VIP 



Spokane. 
WA 



K-6 Reading Remedial distinction in 
Math rdg & math to 

underachieving 
students using 
diagnoses, 
prescription 

Positive contacts with 
family. 

K Parent involvement 

once a wk training of 
parents in game or 
activity that gives 
practice in basic 
skills with followup 
practice with 
students who need 
practice in that skill 
& monitoring of 
student progress. 

K Develop Develop friendly 

Skills feeling parents & 
school, provide 
training for parents 
in how to help 
children at home, to 
send home games 
which reinforce 
skills learned at 
school 
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Evaluation Test Effects Cost Adoption 



No control 325 children pre-post 


CRI 
Classrm 
Rdg. 
Inv. 


avg gain 
1.52 
yrs 


20 hrs 
inser- 
vice + 
1 hr/ 
wk 


No data 


Mean gain by grade on CTBS 
from fall to spring 


CTBS 
Fall- 
spring 








No data on K 










Comparison to comparable school 
on percent scoring above 50 
%ttle I year after 


CAT 


66tf vs. 
W/f 


No data 


No data 


Santa Clara Inv. gain 2.32 mos. 
in dev. age/month 


Santa 
Clara 
Inv. 








No control grp 


Note 
problem 
with 
fall*spg 









1 Effect sires were computed by determining the t value to generate p<" 01 and .05 
respectively. 
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in the same district, which did not differ with respect to entering IQ or SES. 
The achievements of treatment classes and control classes were measured at 
the end of kindergarten, first, second, and third grade using the Metropolitan 
Achievement Test. The effect sizes were 0.89, 1.14, 0.90 and I.I respec- 
tively for these grades. Data for the equivalence oUontrol/expcrimcntal 
classes and method of assignment to treatment were not detailed. 

The evaluation of Astra's Magic Math used random assignment to 
treatment and control classes. The effectiveness of th£"ffrogram was gauged 
by performance on the CTBS. The effect size computed here was 0.45 at the 
end of kindergarten. 

The evaluations do not indicate very much about the goals and 
practices of the comparison group. It is likely that the comparison 
classrooms may have been traditional kindergarten programs that include 
readiness activities, but not in a systematic fashion that assures coverage, 
practice of skills, and teacher feedback. 

There is no evidence supplied that the program is equally effective for 
all students or, on the other hand, that it is not effective for students at risk 
of failure. The general orientation of the program is that of a whole class, 
direct instruction model with individual assistance and remediation provided 
as can be accommodated. That is, remediation efforts are not structured in 
any specific way in these programs, * 

MECCA 

MECCA (Make Every Child Capable of Achicv ng) is a diagnostic/ 
prescriptive program that provides daily observation, assessment, and 
planning for specialized teaching depending on children's needs. Additional 
instruction within the classroom is provided based on students' learning 
profiles. The additional activities arc based on a task analysis of the learning 
activity with which the student is having difficulty. Task analysis is the 
process of breaking down a learning activity into the steps necessary for its 
successful completion, such as breaking down the activity into itb auditory, 
visual, and gross and fine motor components. Specialized instruction! 
prescribed by a team composed of classroom aide, learning disabilities 
specialist, and classroom teacher, is provided cither individually or in small 
groups in the classroom. 

The target group of children is identified by a preschool screening with 
the school psychologist and a speech and language clinician. High-risk 
children are those who do not attain age appropriate scores on three/four 
areas on the DIAL taken in the spring. In September, further assessment is 
done on marginal and high-risk students and an individual programming 
survey is administered to identify particular strengths and weaknesses. 
From this profile, an educational plan is devised. 4 

For example, a child who has a very limited use of expressive 
language— such as single-word responses to questions or no usage of 
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pronouns or adverbs— might be placed in a speech and language program, in 
the beginning of the program, the child would be rewarded continuously 
with praise or with tokens. The basic structure of each lesson is: 

1. Auditory Reception: The instructor gives verbal directions to which the 
student responds with a gross motor or fine motor action ("Pick up the 
tomato"). 

2. Verbalization: The child uses the language she or he has responded to in 
step 1 ("Here is the tomato"). 

3. Reading readiness activities arc combined with a lesson using the words 
and sounds from the first two parts. 

The lessons arc structured to give the child practice in increasingly 
more difficult auditory reception, memory, and other readiness skills. 

Eligible students were randomly assigned to the MECCA treatment or 
a control group. The Jansky Predictive Screening Index, an individually 
administered twenty-minute test, was used as the pretest and one of the 
posttcst measures. There arc five predicting tests: letter naming, picture 
naming. Gates word matching. Bender motor Gestalt. and Binet Sentence 
Memory. The posttcsts used were the Jansky and the Metropolitan Readi- 
ness Test. 

The comparisons found no significant differences in pretest scores 
between the thirty-seven students in the MECCA program and the thirty- 
three in the control classes. At the posttcst, the MECCA group outper- 
formed the comparison group by about seven points (effect size = +.67). 
Similar effects were found on the Metropolitan Readiness Posttest, where 
the effect size was 0.88. 

Another comparison between students randomly assigned to MECCA 
and a "multidisciplinary" comparison group shows similar results (Jansky 
effect size = .57. Monroe Reading Aptitude Test effect si/.c - .96). In the 
comparison classrooms, the children were taught by an LD teacher and 
three other specialists. 

The replication of effects under the multidisciplinary comparison is 
significant, for it suggests that the power of MECCA arises from more than 
its use of specialized personnel. It suggests that the curriculum, materials, 
and approach arc important factors in MECCA'S effectiveness. Screening, 
diagnoses, and task analysis of learning activities target the time and 
resources within the school in a productive way, especially for students very 
much at risk of future failure. 

TALK 

The focus of Project TALK is to improve expressive and receptive language 
skills in children in grades K-3. This is accomplished by structured activities 
that foster language growth. A language specialist teaches specific ex- 
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prcssivc and receptive language lessons to the class twice a week for 
half an hour over a six-month period. The classroom teacher watches and 
then participates in the demonstration lessons, and conducts followup 
lessons twice each week. 

The following sample indicates the type of lesson used in TALK. 

Lesson 92 

TITLE: 
Describe All 

PURPOSE: 

To encourage use of descriptive words of color, size, shape, and quantity. 
MATERIALS: 

List of simple descriptive words— descriptive word list 
PROCEDURE: 

The teacher walks through the class, stopping here and there by a child. The 
teacher describes the child with one word. For insiancc, "Blond John," 
"Listening Susan." "Tired Billy," and so on. The teacher may build from this 
by continuing and adding more descriptive words like "Pretty, blonde 
Sherry," "Clever, old Johnny,** and so on. 

Now the teacher asks the class what has been happening. A short discussion 
of descriptions and descriptive words should follow. How do you describe 
things? What kinds of words do you use to describe things? You use words that 
tell size, shape, color, smell, taste, feelings, and so on. 

The (cache will begin by standing and telling one word about herself or 
himself such as: tall. big. teacher, woman, and so on. Now each child must 
stand one at a time and think of one word or a phrase to tell about him or her, 
such as: little, red hair ; freckles, braids, and so on. When everyone has had a 
chance to tell a word then everyone can have another turn and think of still 
another word that tells about him or her. When a child gives a self-descriptive 
word that might fit another child in the room, stop and discuss it— How many 
people fit Mark's word?— this will broadc ; the lcs5on. 

Younger children may want to pick a favorite self-descriptive word. The 
teacher can make a card for the child to wear pinned on all day that describes 
him or her. This will give the child a chance to explain his or her word to people 
who ask about it. 

MORE IDEAS: 

The teacher picks a simple descriptive word, such as hard, and each child in 
the class must find an object in the room that fits the descriptive word, such as 
"hard floor,*' "hard desk," "hard pencil," "hard window pane," and so on. 
The teacher chooses one word from a box of slips with simple descriptive 
words written on them. Each student finds an item in the classroom that fits the 
descriptive word. Now the children take turns drawing a descriptive word and 
finding objects that fit on his or her own. For a more difficult twiit, especially in 
the upper grades, have children draw two or three descriptive words and place 

them in the proper sentence order, such as a big. red . and not a red. 

hifi . or tiro small . and not small two 
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Three experimental and control schools were compared. Target groups 
of twenty-six students were randomly selected for pretesting from each 
grade level (K-3). In Table 5-2, we provide two sets of results: for overall 
(K-3) and for K alone. 

The Pcabody Picture Vocabulary Test (PPVT) was used to measure 
receptive oral vocabulary. The expressive measure used was the vocabulary 
subtest of the Wcchslcr intelligence Scale (WISC). 

Results for the original experiment and a replication the following year 
gave effect sizes (across all grades) around 0.38. Results presented for the 
separate grades indicate that the program was as effective or more effective 
for kindergarten students as for first- through third-grade students. 

The amount of actual time spent on the Project TALK activities is 
small compared to the results obtained. Only two hours per week arc 
actually spent on the program. If the results arc generally rcplicablc, this 
program seems to be particularly powerful in its effects. Additional evalu- 
ations need to be conducted to learn if these effect sizes arc gcncralizablc. 



MARC 

Multiscnsory Approach to Reading and Reading Readiness Curriculum, or 
MARC, is a continuous-progress K-l reading program. It combines activi- 
ties that emphasize knowledge through the senses with a systematic 
instructional delivery and management system. It is designed to enhance the 
likelihood of success in the beginning reading task for students from 
low-income and/or rural backgrounds. The major goal of the program for 
kindergarten students is to increase their readiness skills, especially in the 
area of letter recognition and auditory perception of beginning sounds. 

For example, introduction of a letter of the alphabet involves the use of 
visual, auditory, and kinesthetic avenues of learning. The MARC program 
specifics the steps to be followed when introducing letters. These steps arc 
called linkages because they link the visual, auditory, and kinesthetic 
approaches.. For example, the step* in this program in teaching the letter 
a arc: 

1. Teacher shows children the letter from the drill pack and says, "T;<is is 
the letter a." (VISUAL) 

2. The teacher asks the children to give the letter name. (AUDITORY) 

3. Children repeat the letter name. (AUDITORY) 

4. Teacher presents the key word (apple) and introduces the sound by the 
key word: "A is for apple." (AUDITORY, VISUAL) 

5. The children repeat the key word and sound while the teacher describes 
how it feels in the mouth and throat. (AUDITORY) 
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6. The teacher asks the children to place their hands on their throat while 
repeating the letter name, sound, and key word to "feel" the sound 
(KINESTHETIC) 

The project was evaluated by comparing an experimental group with a 
matched control group. The MARC and control classes did not differ 
significantly on pretests given in September of the kindergarten year 
(Bochm. Kulhman-Andcrson IQ; SESAT I— Letters and Sounds or Aural 
Comprehension). At the beginning of the next school year, the MARC 
students scored significantly higher than the control on tests of letters and 
sounds and word reading. The effect sizes obtained were + 1.12 for letters 
and +0.88 for word recognition. Because these effects were obtained for 
students in a fall-to-fall testing, they arc not as likely to have resulted solely 
from short-term acquisition of test-taking skills or from artifacts of the test 
themselves, as in fall-to-spring testing. A replication study also produced 

(+055? nt CffCCtS MARC 8r ° UP ' alth0ugh thc cffcct sizc was smallcr 
The MARC program is not currently funded by thc National Diffusion 
Network, although materials arc still available through thc Florida Educa- 
tional Resource for a nominal fee. Thc program has been successfully used 
in Florida in about one-third of the counties and is still being used actively in 
South Carolina. Project MARC has been used as a remedial program for 
older students as well as a regular readiness program. Thc inscrvicc guide 
provides thorough coverage of thc instructional program, assessments, 
grouping, and program philosophy. 



First Level Mathematics 

First Level Mathematics is a continuous-progress, developmental^ oriented 
entering mathematics program. It provides a sequential curriculum and 
management system that provides for individual developmental growth It is 
a diagnostic/prescriptive program. Children take a placement test to deter- 
mine where they will be placed for instruction. Instruction may take place in 
small groups or individually. 

Thc program recognizes that many early math programs require fine 
motor (write numerals) and visual skills that have little to do with mathemat- 
ics skill development. First Level Math docs not require these fine motor 
skills. It teaches thc children thc universal language of math with concrete 
objects and actual physical operations, and progress is made in small steps 
as the child s concepts arc formed. 

Evaluation data are based on norm expectancy comparisons only, 
which unfortunately do not provide very strong evidence of success. 
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Early Prevention of School Failure 



This program provides developmental screening, diagnosis, and training 
based on identified learning styles and modalities. Screening of four-, five-, 
and six-year-olds is carried out in fine and gross motor auditory, visual, and 
language areas using a variety of instruments: the Preschool Language Scale 
(PLS, developed in conjunction with the program), the Peabody Picture 
Vocabulary Test (PPVT), and the Developmental test of visual motor 
integration. On the basis of these tests, a profile is created for each child 
which identifies his or her strengths and weaknesses. Students who arc two 
years or more below expectancy on two modalities arc classified as high 
risk; one year or more below arc classified as medium risk. 

Students are given additional instruction in their weak areas in a 
pullout setting. The program presents guides for direct modality instruction 
in the areas of language, auditory, visual, and fine and gross motor. Student 
profiles indicate their performance in these areas on a scale from I to 5. 

Guides for modality instruction include correlation with major texts 
and breaking down a specific skill into a sequenced set of prerequisite skills. 
For example, if the skill is to tell stories in sequence with/without the aid of 
pictures, nine distinct skill performance areas arc checked: 

1. Child arranges picture stories in sequence. 

2. Child tells story using sequence cards. 

3. Child tells story in parts before retelling entire story. 

4. Child uses felt pieces or fingcrplay to tell story. 

5. Child uses pictures to tell story. 

6. Child uses assorted toys and objects to tell story. 

7. Child unscrambles story. 

8. Child gives a narrative to the scries of drawn pictures. 

9. Child uses puppet to help tell a story. 



The JDRP evaluations do not in general address the issue of a program's 
effectiveness for special populations. Data arc not routinely presented that 
detail the progress of students by race, sex, SES, or entering ability/ 
achievement level. Thus these JDRP data arc not ideally suited for ad- 
dressing the question with which wc began this chapter. 

However, as a basis for improving practices for students at risk in the 
kindergarten years, this data base is a worthwhile place to start. It seems 
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likely that sound instructional programs, with demonstrated eifcctivcness 
across several sites, will be effective for most students. Analyses of 
aptitudc-by-trcatment interactions from studies of later grades do not find 
many interaction effects; programs effective for one subgroup tend to be 
effective for others (sec Slavin and Madden. 1987). On the other hand, it 
may be that the nature of differences between the backgrounds of disadvan- 
taged and advantaged children as they enter school would make programs 
differentially effective for different subgroups at the kindergarten level. 

There is some evidence (Lysiak and Evans. 1976) of interaction effects 
for program and SES background. In this examination of the effectiveness of 
different kindergarten programs, they found that the lower SES students 
benefitted in particular from a structured curricular approach. This finding is 
consistent with the common wisdom about the need for structure for 
disadvantaged students, although it is really not clear what structure means. 
On one hand, structure is conceived of as the opposite of the open 
classroom, itself an ill-defined intervention. On the other hand, structure is 
thought o r as rigid and heavily prescribed. Semantics aside, structure in the 
sense of a systematic approach to instructional delivery is a vital ingredient 
for any effective program. What is striking about the kindergarten ap- 
proaches here— which encompass a wide variety of philosophies— is the 
cxlcntof the specificity of activities, planning, and goals. Effective programs 
are ones that arc detailed and specific. 

The systematic aspects of programs may be more important in effec- 
tiveness than arc the philosophical aspects. Is this a program that can be 
implemented on a day-to-day basis by a regular teacher facing thirty 
students? Wc do find successful systematic approaches that encompass 
quite different philosophies. For example, with respect to the degree of 
individualization and attention to individual differences, there arc effective 
programs that focus basically on providing whoic-class instruction (that is. 
that do not differentiate instructional pace, delivery, or content*, and there 
arc effective programs that have as their basic premise the need to focus on 
individual strengths and weaknesses. 

This contrast is seen in the approaches taken by Early Prevention of 
School Failure and MARC. Early Prevention of School Failure diagnoses 
modality strengths and weaknesses and tailors instruction to these mo- 
dalities. MARC underscores the significance of different avenues of learn- 
ing but docs not differentiate instruction for students based on their 
modality profiles. Both programs assume that an appropriate kindergar- 
ten program is multisensory in its approach: EPSF customizes instruc- 
tion or provides modality training for those below a certain cutoff point, 
MARC structures every lesson to include linkages among the different 
avenues of learning. Not only are these programs quite different in Iheir 
philosophical approach to learning style differences, they tequtte quite dif- 
ferent staff and support personnel as well. A le.im of specialists including 
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a psychologist and speech clinician is required to evaluate and screen all 
students in the KPSF model. Instruction is then provided in a pullout 
format to address modality weaknesses, such as practice in line motor 
coordination. 

Thus, although different approaches may be effective, effective kinder- 
garten practices incorporate specific materials, management plans, activi- 
ties, and structures. The teachers have an instructional plan that they follow 
and specific activities that make sense in the context of that plan. The 
programs arc not overly rigid, nor do they reduce teachers to automatons — 
but they arc— specific. Such specificity is needed to ensure a faithful 
implementation of a program. 



We have evaluations of reasonable adequacy that compare programs to 
control classes. However, we do not have data about the relative effective- 
ness of these different approaches or their effectiveness for different 
students. Is a program of screening and instruction in specific modalities 
more effective than a regular class-paced approach? What arc the relative 
costs and implementation difficulties of the different approaches? These 
basic data are needed to make intelligent decisions about approaches to the 
kindergarten year — to build on our r Elective past experiences in a system- 
atic way. Otherwise, the present practice of individual districts building their 
individual curriculum without benefitting from the successes and failures of 
other locations seems likely to continue, improving educational practice can 
be a cumulative effort, but it requires sustained and systematic evaluations. 
As the kindergarten has become the first formal academic experience for 
:nost students, the need to understand effective practices for this critical 
stage in students* schooling is great. 
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Over the past decade, the U.S. educational system has been the focus of 
intense scrutiny. By the end of 1984, thirty national reports had been written 
and over 300 state task forces had been organized to examine issues of 
public education (Cross, 1984). The "parent" report, "A Nation at Risk/' 
has become both an ubiquitous reference in opening paragraphs of current 
educational literature as well as a household phrase. 

This exposure has spawned numerous calls for reform. Plans to reform 
our schools have come from all levels of government— local, slate, and 
federal— and from a variety of stakeholders in the educational process- 
educational researchers, teacher educators, and teachers themselves, An 
underlying assumption of many of the reform plans is that the central 
problem is one of standards— standards that many critics claim are untie- 
ccptably low. Hence, suggested remedies often focus on raising standards 
and on making instruction more demanding and more uniform for all 
students. The danger in this focus is that, unchecked, it may result in 
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increased failures of those least able to meet the new standards, and may 
provide a rationale for the funncling of a disproportionate amount of future 
resources toward the more gifted students— those who will likely rise to the 
occasion of higher standards. What is needed is a parallel focus on the 
provision of quality and equitable schooling experiences on which higher 
expectations for all students could justifiably be based. 

A significant percentage of our nation's school-age population contin- 
ues to fail to learn the most basic literacy skills necessary for informed and 
responsible participation in our society. Furthermore, without the prere- 
quisite reading and math skills, such individuals find it difficult to master the 
relatively higher-level skills necessary for the increasing number of techno- 
logical jobs available in our society. Given the predicted demise of unskilled 
occupations, such individuals arc doomed to become economic prisoners of 
their low educational achievement. Moreover, evidence suggests that the 
number of such imprisoned individuals will steadily increase. Relatively high 
proportions of low-SES and minority children in today's schools have 
academic deficits in need of remediation. An analysis of demographic data, 
reported by Reynolds and Lakin (1987), projects that the number of children 
under age eighteen who arc nonwhite and who arc poor will grow at a 
disproportionately fast rate over the next two decades. The predicted 
remedial needs of these children, of course, will be reflective of social and 
economic factors beyond their control. Nevertheless, the needs arc no less 
pressing. The schools must accept the responsibility to provide these 
children with the most effective academic education possible, yet the 
increasing number of children likely to be in need of such remediation can be 
expected to put additional strain on an already overburdened special and 
compensatory educational system. 

This problem demands a response. Our response can be reactive or 
proactive; it can be driven by political, economic, and/or demographic 
factors, or it can be driven by an educational knowledge base gleaned from 
reflections upon practice and research. A proactive, educationally informed 
response rcqur.s taking stock of what wc know and what we still need to 
know about educating children for whom traditional forms of schooling have 
been less than satisfactory. Hopefully, the chapters in this book make 
strides in that direction. 

This chapter also sets out to explore that territory. It docs so with a 
focus on variables related io instruction and learning. Through this lens, wc 
will review the changing responses to low-achieving /hildrcn already in 
practice as well as promising research findings upon which wc might base 
future instructional practices. The chapter is organized into three major 
sections: (I) a brief overview of the historical responses to low achievement, 
(2) a description of current practice, and (3) a discussion and analysis of the 
instructional knowledge base that is needed to inform more effective 
responses to low achievement. 
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The perspective from which this chapter is written builds upon past 
analyses (Leinhardt, Bickel, and Pallay, 1982) and includes two assump- 
tions. First, what is effective instructional^ with one type of low-achieving 
student tends to be effective with other types; hence, the distinctions 
between compensatory education and the various categories of special 
education for the mildly handicapped arc not driven by educational necessi- 
ties. Second, due to lack of evidence that restrictive settings promote higher 
student achievement than regular classrooms, and the moral and social 
problems associated with those restrictive settings, students should be 
educated in the least restrictive environment possible. We take this as both a 
moral imperative and a legal one (in light of PL 94-142). In this chapter, we 
are concerned with the educational conditions that will most effectively 
promote positive academic outcomes for compensatory education students, 
for those categories of special education students traditionally referred to as 
mildly handicapped (learning disabled (LD], socio-cmotionally disturbed 
(SED], and cducably mentally retarded (EMRj), and for unidentified "slow 
learners" (the bottom 20-25 percent of the achievement distribution). 

HISTORICAL RESPONSES TO 
LOW ACHIEVEMENT 

Large-scale problems in learning have been recognized since the turn of the 
century when the passage of compulsory education laws drew children with 
extremely diverse ability and motivational levels into the public schools 
Initially it was felt that the provision of equal educational exposure was 
sufficient, and that early school exit vv v, an individual decision or failure. 
Gradually, this position changed. The next prominent strategy was to 
identify ;hildrcn for whom learning in traditional classrooms was difficult 
and remove them to sheltered special education lings. This strategy was 
reflective of a concern (manifested by parents ~nd educators alike) that 
pan. ^pation in regular schools was harmful to these children. The belief and 
evidence was that handicapped children suffered rejection from their peers, 
experienced frustration due to failure on routine academic tasks, and built 
up expectations of future school failure. Instruction provided in a warm, 
nurturing environment by specially trained teachers, it was thought, would 
be much more beneficial to both their academic and emotional welfare. 

Beginning in the early 1960s, individuals began to exhibit concern 
over the educational problems encountered by yet another group of 
students — children from impoverished backgrounds. Unlike the handi- 
capped children first identified earlier in the century, these students* poor 
academic performance was seen as due to the social and economic condi- 
tions in their homes and communities rather than due to organic deficits. 
Therefore, several programs were established by the federal government in 
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which poor children were to receive supplemental instruction by specially 
trained teachers using innovative techniques before, during, and after 
school. It was hoped that the provision of this enriched educational 
experience would remediate the students' academic difficulties which, in 
turn, would help to free them from a future of poverty. 

In the 1970s, the segregation of low-achieving students was challenged 
by several court cases. The cases established the constitutional right to 
equal access to an appropriate education and upheld the principle of 
integration. The mainstreaming movement is based on the belief that both 
special needs and regular children need to learn to participate side by side in 
a multifacetcd, integrated society. Ironically, many of our educational 
programs for low-achieving students, although well intcntioncd, have failed 
to promote such integration; in many cases, they have increased separation 
of children by race and achievement levels. The following sections provide a 
brief overview of the history and current structure of both the compensatory 
and special education systems. 



Compensatory Education 

Our current system of compensatory education began as part of President 
Johnson's "War on Poverty." During the 1960s, there was increasing 
concern about the educational prospects of children who came from poor 
families. The attention of educators, politicians, and parents was focused on 
two interrelated concepts: "cumulative deficit" and the "vicious cycle of 
poverty." Essentially, these phrases refer to the belief that children who 
begin school performing behind their peers remain so and fall continuously 
further behind. The belief is that these students arc likely to exit from school 
U a point when they are still ill-prepared, like their parents before them, to 
compete successfully in society. Compensatory education is the nw ; or 
component in the system of social programs designed to break this cyr , oy 
"compensating" for initial deficits. 

Compensatory education provides services to schools with a signifi- 
cant percentage of children whose families are below the poverty line. 
Within qualifying schools, students arc then identified to receive compen- 
satory services based on some measure of achievement (e.g., more than 
onc*half grade equivalent below grade level). Both the selection of schools 
and the administration of programs arc based on federal guidelines as 
interpreted by state educational agencies. Local educational agencies 
(LEAs) carry out the federal and state directives and generally spend money 
on remedial teachers, instructional aides, and equipment. 

Federally supported compensatory education has its functional bases 
in two legislative acts: the Economic Opportunity Act (EOA) passed in 1964 
(and amendments to it), and the Elementary and Secondary Education Act 
(ESEA) passed in 1965. EOA established Head Start; ESEA established 
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Title I and Follow Through programs. With the passage of the Education 
Consolidation and Improvement Act (EC1A) in 1981, the programs provided 
by ESEA Title I were transferred to Chapter I of the new legislation. 

Head Start 

Head Start, a its name implies, is a preschool program servicing children 
from age 3 to school entry. If a family meets low-income requirements, and 
if the community has a Head Start program available, then a child can gain 
access to Head Start. The fundamental goals of the program arc to guarantee 
children's physical well being, to aid their social-emotional development, to 
improve their mental processes and skills, to establish patterns of success, 
to improve family relationships, to develop social responsibility, and to 
increase children's sense of self-worth and dignity (Richmond, Stipck, and 
Zigler, 1979; Zigler and Anderson, 1979). 

To answer the often-asked question, "Has Head Start been success- 
ful?" is not easy. Over time, evaluations of the effectiveness of Head Start 
have been mixed; there have been many individual success stories as well as 
many examples of unproductive efforts (Wolff and Stein, 1966). One of the 
impediments to consistent findings is the fact that Head Start is the label of a 
program and not an identifier of a particular instructional (or other) 
treatment. Under the heading of Head Start, many very different instruc- 
tional programs have been implemented, some more successful than others. 
Another problem in evaluations of Head Start has been lack of agreement 
over when and what criterion measures should be adopted. A major finding 
with many of the Head Start programs has been that when participating 
students are tested in kindergarten and first grade, they show few, if any, 
lasting academic gains (McKcy, Condclli, Ganson, Barrett, McConkcy, and 
Plantz, 1985). It has been pointed out, however, that the initial Head Start 
goals also included medical and social features, which have sometimes been 
lost in the storms surrounding this apparent academic "failure." 

The Perry Preschool Program is an example of one of the most nota- 
ble and well-documented successes (Barnctt, 1985; Bcrructa-Clcmcnt, 
Schwcinhart, Barnett, Epstein, and Weikart, 1984; Schwcinhart and Wci- 
kart, 1980, 1985; sec Chapter 4). In this program, it was precisely the 
medical and social goals that were met, and when they were met, they had a 
powerful impact on life chances. Findings from a series of well-designed 
longitudinal studies show that by age nineteen, those students who attended 
the Perry Preschool Program were more likely to have finished high school, 
were dramatically less likely to have committed a crime, were less likely to 
be on welfare, and were earning more money (Barnctt, 1985). Convergent 
findings have resulted from the research of Lazar and Darlington (1982) in 
their estimates of the reduction of special education assignments which is 
attributable to participation in Head Start. In summary, it is important to 
note that individual programs under Head Start have been successful even if 
the total collection of initiatives demonstrates mixed results. 
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Over the years, Head Start has maintained its base of financial support 
with a current annual budget of approximately one billion dollars (EC1A, 
1981; HSAA, 1984; Schweinhart and Wcikart, 1985). Head Start serves 
more than 400,000 children each year (White and Castro, 1985). 

Follow Through 

In 1967, President Johnson recommended the establishment of a "follow 
through" program that would systematically build upon preschool Head 
Start programs by extending educational support from kindergarten through 
the third grade. Due to budget cuts (from a recommended $200 million to an 
appropriated SI 5 million), however, a new conception of Follow Through 
surfaced: the so-called planned variation approach (Haney, 1977), In short. 
Follow Through dollars would not be used to extend services to former 
Head Start students but rather would be used to develop demonstration 
projects around the country designed to identify promising instructional 
treatments. 

One important finding of the planned variation studies was that 
program membership per se made little difference in children's learning. For 
example, studies that contrasted Follow Through with non-FoIlow Throufh 
students showed little consistent advantage for program participation be- 
cause, like Head Start, the Follow Through programs were quite different 
across sponsors. In addition, the characteristics of students assigned to 
Follow Through programs varied. However, the Follow Through programs 
that engaged in elaborate programmatic descriptions and data collection 
were valuable in developing our understanding of what worked and why 
(Gcrstcn, Carninc, and White, 1984). 

Evaluations were essentially of two types: very large-scale national 
evaluation such as the controversial but often insightful studies conducted 
by Abt Associates (Stcbbins, St. Pierre. Proper, Anderson, and Ccrva, 
1977), and smaller project-level studies that focused on implementation 
evaluation and the impact of program features (Becker and Gcrstcn, 1982; 
Bercitcr and Kurland, 1981-1982: Gcrstcn, 1984; Gcrstcn, Carninc, Zorcf, 
and Cronin, 1986; Lcinhardt, 1976, 1977). Glossing over many of the 
important methodological and policy issues that were raised by these 
studies, it seems the findings point in similar directions. They tend to sug- 
gest that of the models available, the most effective were bchaviorally 
focused, emphasized learning math and reading, had systems for monitoring 
student progress, provided performance-based feedback, and had mecha- 
nisms that ensured reasonable levels of implementation. The individual 
studies carried out by the program designers and developers or by their 
evaluation staffs found results similar to the national evaluations and smaller 
project-level studies. Students learned more in those classrooms where 
much of the available time was spent on highly directed learning of basic 
skills and in which the progress of individuals or groups of students was 
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monitored and in which some type of reinforcement was available (Stcbbins 
ct al., 1977). In recent years, annual allocations for Follow Through 
programs have decreased substantially. 

Title I/Chapter I 

Title I was launched in 1965 as part of the Elementary and Secondary 
Education Act. It was a massive program that aimed to provide support to 
school districts that were heavily impacted by high numbers of disadvan- 
taged and low-achieving students. Maximum flexibility was given to LEAs 
to develop programs to help educationally deprived children catch up with 
their peers. In 1981, the enactment of Reagan's Education and Consolidation 
and Improvement Act modified the way former Title I resources were 
distributed and monitored. Although the educational goal of Title I (hereaf- 
ter called Chapter 1 ) funds remained the same (i.e., the provision of support 
to states and districts heavily populated with low-income and low-achieving 
students), constraints were put on the ways in which the funds were used. 
For example, there were increased efforts to monitor funding flows in order 
to assure that only Chapter 1 eligible students received services and that 
those services were clearly supplemental to the regular classroom instruc- 
tion (Dougherty, 1985). 

Like Head Start and Follow Through, Chapter 1 programs vary 
considerably from district to district. Evaluations of Chapter 1 programs that 
carefully describe the instructional features of the program as well as the 
academic progress of students have produced the most useful data. For 
example, the Instructional Dimensions Study (IDS) (Coolcy and Lcinhardt, 
1980) showed that, after adjusting for initial student ability, the time 
available for learning subject matter coupled with the coverage and focus of 
that subject matter (overlap) was more important than any program style or 
instructional organization. In addition, the Sustaining Effects Study (SES) 
(Carter, 1984) also found that the opportunity to learn was very important 
and that other valued strategies (such as grouping and better teacher/student 
ratios) were related to increases in opportunity, although themselves not 
directly influential on student learning. These two studies (IDS and SES) 
and another large study, the Beginning Teacher Evaluation Study (BTES) 
(Fisher, Filby, Marliavc, Cahcn, Dishaw, Moore, and Berliner, 1978), again 
emphasized the need to look at specific instructional practices rather than at 
programmatic labels. 

Special Education 

Unlike compensatory education, which arose as a response to a perceived 
social and economic problem, special education arose as a response to 
problems that were perceived to reside within the individual. Before the turn 
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of the century, such problems were largely ignored by public institutions; a 
handicapped child was seen as an individual and private family concern, not 
as a societal responsibility (Lazcrson, 1983). With the influx of a much wider 
range of students into the schools brought on by compulsory education laws, 
these students could not so easily be ignored. Special programs and schools 
were designed and teachers were trained who would be especially sensitive 
and skilled in dealing with children possessing the various handicapping 
conditions. Parents of handicapped children formed strong advocacy groups 
and successfully lobbied Congress for funds to support a wide array of 
categorical programs for the handicapped. It should be noted that, although 
this time period witnessed an upsurge of interest in handicapped children, 
the predominant remedial strategy was to segregate them from their 
nonhandicapped peers and place them into isolated settings that offered 
little or no opportunity for interaction with their nonhandicapped peers. 
As mentioned earlier, the rationale behind this strategy was to protect 
and nurture vulnerable children and to avoid exposing them to ridicule 
and failure. 

The segregated nature of soccial education was directly challenged by 
what has cofjc to be known ;\%hc piainstrcaming movement. Mainstrcarrf 4 
ing is based on the principle of the least restrictive environment as expressed 
in court decisions {Pennsylvania Association for Retarded Children I PARC] 
v. Pennsylvania, 1972> and in law (PL 94-142, 1975). Currently, the United 
States is dedicated, in principle, to the maximum integration of all students. 
In fact, as we shall soon sec, the present situation reflects the long history of 
special programs still operating quite separately. 

From the outset, special education was presumed to be beneficial. It 
was widely held that an accepting environment, modified instructional goals 
and techniques, and teachers with special training in specific areas of 
exceptionality would lead to more effective programs for handicapped 
individuals. Hence, special education programs initially received little 
systematic scrutiny (Shcchan and Kcogh, 1984). Recently, however, a 
number of political, economic, and professional pressures have made 
evaluating and demonstrating program impact a high priority. Unfor- 
tunately, evaluation studies in special education have been plagued by a 
variety of conceptual and methodological problems, some of which parallel 
evaluation research in compensatory and regular education and some of 
which are unique to special education. 

A problem already noted with respect to many compensatory educa- 
tion studies is the failure to adequately specify instructional features 
(independent variables). The special education literature shares this neglect 
of treatment definition and measurement. Although groups of students in 
different types of programs arc often compared on various outcome mea- 
sures, the components of these programs typically arc not explicated or 
analyzed. 



ERIC 



Instructional Issues 153 



The failure to provide descriptive— as well as analytical— information 
about instructional features may be related to a second issue in special 
education research, its authorship. Program developers or designers arc the 
most frequent c valuators of special education programs. Because they most 
often evaluate their own programs, their most pressing question has been 
"Does it work?" This often results in pre-post designs that leave program 
content unmeasured. Hence, even after significant differences have been 
found in favor of the treatment group, one is still left with the unanswered 
question: "What was it about the program that led to its success?" There 
has been a trend toward the use of "degree of implementation" measures by 
program designers to specify the extent to which the program is being 
implemented as intended, the so-caMcd fidelity approach (Fullan and Pom- 
fret, 1977). This degree of implementation information is then used to 
attempt to explain various outcome measures and *o inform the developer as 
to revisions required to bring the program closer to the ideal. Al'hough a 
step closer to unveiling the "black box" of treatment variables, degree of 
implementation measures, it has been argued, often fail to adequately 
unmask program features 'l^inhardt, 1980). 

Inadequate specification and imprecise measurement has also plagued 
dependent variables in special education research. As noted earlier, the 
most frequently stated goal or desired outcome for special education is to 
help each individual child reach his or her maximum potential, a goal shared 
by regular education. Although philosophically sound, this goal cannot be 
easily opcratfonali£ed. Traditionally, the IQ has been the most common 
outcome measure for early intervention and the standard achievement test 
for special education. The inadequacies of these measures for special 
education populations, however, have been well documented. They include: 
the lack of growth norms for handicapped learners, possible cultural bias of 
item content, and little overlap between the content of special education 
curriculum and the content of test items (MacMillan, Kcogh, and Jones, 
1986). Recently, there has been a call for broader definitions of outcome 
variables in special education research (e.g., the social goals that were 
achieved in the Pcny Preschool Project— high school graduation, success in 
later life, etc.). It must be recognized, however, that such rcconccptualiza- 
tion of outcome domains carries with it new demands for the development of 
adequate and sound designs and instrumentation. 

Finally, interpretation of the results of special education research is 
limited because of the variability over time in definition of various mildly 
handicapped categories and programs. This limitation on interpretation of 
findings stems from both historical and present-day practices. Historically, 
the changing criteria for membership in the various mildly handicapping 
categories has led to the unfortunate situation where the results of research 
arc valid only for the period of time for which the definition parameters for a 
category remained unchanged. Results from present-day studies also arc 
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questionable due to inconsistencies about both definitional and placement 
criteria for subject selection. For example, many studies use system- 
identified children, but criteria for access to various special education 
programs can vary from district to district. A child classified as LD in one 
district may possess different characteristics from a child classified as LD in 
another district (sec Ysscldyke and Algozzinc, 1983). 

Given the many limitations of special education research, findings on 
effective practices must be treated with caution. Nevertheless, some find- 
ings seem strong and robust enough to appear with a fair amount of 
consistency. Furthermore, those findings that arc consistent seem to emerge 
from studies that arc focused on the details of instructional practice as 
opposed to more global programmatic variables. 

One such set of effective practices can be grouped under the label of 
teacher-directed instruction, also referred to as direct instruction (Rosen- 
shine, 1983). This refers to instruction that is planned and presented by the 
teacher and that can be characterized as systematic and academically 
focused. Concepts arc presented in sequential order of difficulty and arc 
related to what students already know. Teachers mod*l correct perfor- 
mance, closely monitor student performance, and provide corrective feed- 
back during recitation, drill, practice, or application activities. Active 
academic responding by students is encouraged. Recently, direct instruction 
has been scrutinized regarding its efficacy in promoting higher-order 
thinking. Although direct instruction approaches tend to be associated with 
high student scores on achievement tests, it has been suggested that direct 
instruction may not be the best method for encouraging the development of 
critical thinking and problem-solving (Doyle, 1983). We discuss this point in 
depth in the final section of this chapter. 

A related set of instructional practices was found to be effective in a 
recent comprehensive review of studies on academic interventions with LD 
students. They include instruction that is (I) goal-directed (has a close 
correspondence with terminal objectives); (2) practice-loaded; (3) strategy- 
laden (teaches students the processes for performing academic tasks); 

(4) indcpcndcncc-oricntcd (provides a great deal of teacher direction in 
initial stages followed by increasing amounts of student self-direction); and 

(5) detailed and comprehensive (Lloyd, 1988). It should be noted that the 
above characterizations of effective instruction with special education stu- 
dents arc not setting-bound. That is, no evidence supports the notion that a spe- 
cific practice works only in a specific special education program setting. 



DESCRIPTION OF CURRENT PRACTICE 

As described in the previous section, the various preventive and remedial 
programs falling under the rubric of special and compensatory education 
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were initiated independently of one another as well as from the mainstream 
of regular education. Their separate roots arc still very evident in the 
mutually exclusive funding, entitlement, and administrative procedures that 
govern current practice. Services to compensatory education students and 
the mildly handicapped students arc regulated by two very separate adminis- 
trative bureaucracies and arc typically delivered without significant inter- 
program coordination. 

Mildly handicapped special education students generally receive their 
education in one of three settings: (I) self-contained classrooms from which 
they arc mainstrcamcd to some extent, usually for nonacademic classes; (2) 
resource rooms in which they receive special help for reading and math 
(students arc^pullcd out" of their regular classroom to attend resource room 
instruction); or (3) regular room placement with a resource teacher who 
helps occasionally or daily to sec that the targeted student is keeping pace. 
In self-contained classes there arc between twelve and fifteen students with 
one specially certified teacher and, depending on the disability, one aide. 
Almost 70 percent of the school. day is devoted to reading and little or no 
time is spent on social studies or science (Lcinhardt, Zigmond, and Coolcy, 
198!; Norman and Zigmond, 1980). In resource rooms there arc usually six 
to twelve students who go there for one or two periods a day (less lhan 30 
percent). Both settings tend to emphasize bchaviorist approaches to instruc- 
tion and assessment. Thai is, specific objectives arc set and students work 
(often using worksheets) to master limited and subsequently tested skills 
Global, integrative work is uncommon. 

Compensatory education students arc cither in regular classes with a 
teacher and an aide or with a teacher and a part-time specialist teacher, or 
they attend pullout classes, working with a specialist teacher (sec Chapters I 
and 7). Significant percentages of both reading and math instruction arc 
typically given in pullout rooms. Class size for pullout rooms is variable and, 
as with special education, the most successful and predominant style of 
instruction !s bchaviorally based. 

Disadvantages of the Pullout Method - 

The pullout or resource room approach is currently the predominant method 
used to deliver educational services for low*achicving students (Carter, 
1984). However, this method has not been shown to have positive effects on 
student learning. In fact, pullout programs too often result in a disjointed 
educational experience for the very children who arc most in need of the 
best possible educational experience. In the following discussion, we 
consider five issues related to the delivery of instructional services using 
the pullout technique; (I) instructional fragmentation, (2) time erosion, 
(3) oversight responsibility for the individual student, (4) ownership of 
educational services, and (5) procedural requirements. 



ERIC 



156 Instructional Issues 



Foremost among problems associated with the pullout model is lack of 
attention to instructional coordination between what is taught in the regular 
classroom and what is taught in the pullout room. From the students' 
perspective, they may feci as if they arc on a roller coaster of special 
assignments. Content may be duplicated or not covered at all, depending on 
the configuration and timing of services received. It is not unusual, for 
instance, for a child to receive two math classes and no science. Within a 
particular subject (e.g., math), it becomes the individual student's responsi- 
bility to make sense of the all-too-oftcn disparate learning tasks that arc 
assigned under the auspices of these two very different classroom situations. 
In order to gain a true understanding of a particular domain, the students 
must, on their own, integrate fragmentary instructional content into a 
meaningful and internally consistent conceptualization. Such fragmentation 
of instruction could be expected to pose problems for any student; for 
low-achieving students it is particuarly devastating. (Sec Chapter 1 1 for an 
excellent discussion of problems in instructional coordination between 
regular and pullout classes.) 

From the teacher's perspective, multiple pullouts result in a managerial 
headache. It is not unusual for a regular classroom teacher's daily instruc- 
tion to be almost continuously interrupted by students leaving for (or coming 
back from) Chapter I instruction, bilingual instruction, learning disability 
resource rooms, and so on. Such pullouts often happen in odd patterns 
throughout the school day. In cases where a single student has been declared 
eligible for multiple programs (e.g.. Chapter I, learning disabilities, speech 
therapy), there may be insufficient hours in the school day to remove the 
student for all of the extra assistance to which he or she is entitled (Carter, 
1984), It is ironic that one of the instructional features consistently found to 
be related to increased student learning— time spent actively engaged with 
criterion (or tested) material— is exactly the feature that is undermined when 
low-achieving students spend unproductive time traveling to and from 
special assignments. 

The lack of coordination of instructional content mirrors the more 
general problem of lack of rcsponsiblity for the individual child. Typically, 
homeroom teachers accept an oversight responsibility for regular elemen- 
tary school students. Homeroom teachers arc likely to be in touch with the 
attendance patterns, emotional profiles, and typical achievement levels of 
the children who report to them first in the morning and last in the afternoon. 
Furthermore, elementary students often spend much of the school day 
receiving instruction from their homeroom teacher. Which professional 
oversees the child who spends significant chunks of the school day in two to 
three or more different places? It has been suggested that regular teachers 
often "tolerate" but show little sustained interest in the overall welfare of 
such students (e.g., Cohen, Intilli, and Robbins, 1978). In some ways, 
resource room assignment can be likened to departmentalization, which 
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Slavin (1987) suggests runs the risk of diffusing responsibility for individual 
children. An unforeseen consequence may be that the child fails tc forr.i a 
bond or identification with a single caring adult. 

A closely related concern is the lack of loo:! ownership over the 
cducationa! services provided to children with special learning needs. Most 
personnel who run special programs in elementary school buildings arc 
directly accountable to Chapter 1 supervisors, district special education 
directors, or other similarly "removed" supervisors. This represents a 
concomitant loss of control at the level of the building principal— another 
irony given findings from the school effectiveness literature on the impor- 
tance of building-level leadership. Increasingly, empowerment has been 
stripped from building principals as special programs have become charac- 
terized by a narrowing of leadership (Heller. Holtzman. and Mcssick. 1982). 

This piogram-speeific. often geographically distant, leadership can be 
seen as related to the tremendous increase in the procedural requirements 
placed upon teachers and school administrators. The requirements include 
preparation of individual educational plans (IEPs), scheduling of meetings, 
and other accountability procedures necessary to ensure dollar flow. Such 
procedures arc time consuming and often detract valuable resources from 
attention to more substantive aspects of the education of low-achieving 
children. It has also been suggested that a litigious atmosphere is created by 
an overemphasis on procedural requirements which, in turn, heightens 
distrust between teachers and parents (Reynolds and Wang. 1981). 

This close procedural monitoring may also strain relationships between 
regular and remedial teachers. For example, federal guidelines clearly 
specify that Chapter ! monies only be used to purchase material or hire 
personnel to assist Chapter 1 eligible students. Any spillover of benefits to 
regular students is prohibited. Strictly speaking, a Chapter 1 aide working in 
a regular classroom may not respond to a request for help by a non-Chapter 
1 eligible student. Similarly, books bought with Chapter I monies must be 
clearly marked as such and not shared with other students. Clearly, the spirit 
of the law has been subverted by overly rigid procedures aimed at monitor- 
ing adherence to the letter of the law. 



Questions about the Instructional Efficacy 
of Separate Systems 

Although there is widespread recognition of the problems associated with 
the pullout approach, many educators argue th?i improvements can be made 
without a complete overhaul of the overarching structure of educational 
programming for low-achieving students. Indeed, the stakes for maintaining 
the present separate systems arc perceived to be quite high by many 
individuals who arc involved in what Reynolds and Lakin (1987) have 
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termed the "collateral enterprises of teaching, teacher supervision, child 
assessment, teacher training, advocacy, product development, research and 
scholarly writing, and professional organization." These collateral enter- 
prises have developed hand in hand with, in particular, the various special 
education classifications (e.g., learning disabilities, mental retardation) and 
are often perceived to be dependent on the continuation of a system that is 
built around various categorical procedures. 

Arguments against retaining the present system, on the other hand, 
hinge on evidence that children assigned to resource, pullout, or self- 
contained settings arc not receiving qualitatively better instruction. This 
evidence comes from several fronts. In the following discussion we will 
briefly summarize selected findings from three lines of work that cast doubt 
on the instructional efficacy of continuation of the present structure of 
separate educational systems for low-achieving children. They arc: (I) 
comparative studies of the opportunity to learn in remedial settings versus 
regular classrooms, (2) work in the area of special education classification, 
and (3) the growing knowledge base on effective instructional practices that 
arc not classsification- or setting-bound. 

Opportunity to Learn 
Comparative studies have suggested that placement in Chapter I or special 
education settings docs not guarantee increased opportunity to learn aca- 
demic material. For example, data gathered through intensive observations 
of nearly 100 Chapter I and non-Chapter I fourth-grade students from four 
different schools suggest that the assumption that placement in a Chapter I 
setting will provide more structure and opportunity to engage in reading and 
math may be incorrect (Stanley and Greenwood, 1983). Although different 
structural contexts arc often provided in Chapter 1 settings, the effect on 
students' opportunity to learn may not be straightforward. For example, 
Stanley and Greenwood's data show that Chapter I students were engaged 
an average of eleven minutes less per day than students in non-Chapter I 
settings. Similarly, results from a scries of studies conducted at the Institute 
for Research on Learning Disabilities at the University of Minnesota suggest 
that placement in special education settings is not necessarily associated 
with increased amounts of student engaged time (Thurlow, Gradcn, 
Greener, and Ysscldykc, 1982; Thurlow, Ysscldykc, Gradcn, Greener, and 
Mccklcnbcrg, 1982; Ysscldykc, Thurlow, Mccklcnbcrg, Gradcn, and 
Algozzinc, 1984). For example, Ysscldykc ct al. (1984) examined change in 
level of service from pre- to postplaccment. Although their findings must be 
treated cautiously due to the small sample size, the results of their 
observations indicated that the students did not receive more intensive 
services in special education than they received in the regular education 
environment. Instead, it was found that amounts of academic instruction 
and responding time actually decreased two months after assignment to 
special education. 
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Several characteristics of remedial instruction may be seen as related 
to the depressed levels of academically engaged time often found in pullout 
programs. At the most obvious level, less instructional time can be seen as 
partly a result of time spent moving from class to class and getting down to 
work once arriving at the various settings. In addition, students in special 
settings may be spending considerable time in academically unproductive 
process training (Arter and Jenkins, 1979), Time spent on perceptual, social, 
and motor skills training represents a corresponding loss of time spent on 
academic skills. Finally, it has been suggested that the pace of instruction for 
low-ability students is slower and that teachers focus more on low-level 
objectives and routine procedures than they do with high-ability students 
(Shavclson and Stern, 1981; Wcinstcin, 1987). The overall effect of the 
above characteristics of instruction is less opportunity to learn what is 
subsequently valued and tested throughout the students' schooling careers. 
Furthermore, the problem of instructional fragmentation mentioned pre- 
viously only compounds the effects of lack of opportunity to learn. When 
students are engaged in academic work, the content may seem unintegratcd 
and meaningless. This experience stems from the use of different materials 
and methods in the different settings and/or to the lack of communication 
bctwcccn the different teachers about coordinating the instructional pro- 
grams. 

Classification Issues 
Historically, the classification of special needs children into specific catego- 
ries (c,g.. LD, EMR) has served several functions. One of the more positive 
functions of the classification system has been the utilization of the various 
categorical labels as unifying symbols around which public and legal support 
has coalesced. Parents and other concerned parties have formed powerful 
advocacy groups for each of the categorical disabilities and have been 
successful in lobbying Congress for fiscal support for a variety of social and 
educational programs. 

In addition to their public support function, categorical labels have 
been viewed as possessing a diagnostic and prescriptive function. Until 
recently, it was assumed that the various categorical disabilities accurately 
reflected different handicapping deficits, stemming from different root 
causes, and capable of being accurately diagnosed through standardized 
psychometric assessment procedures. Furthermore, once diagnosed, it was 
believed that treatments could then be prescribed or tailored to the specific 
"illness" that had been uncovered. These treatments would then be 
administered by professionals who were specially trained in the technical 
knowledge about the causes and cures of the problem. 

The above model of classification and treatment has come under attack 
in recent years. For example, Reynolds and Lakin (1987) have argued that a 
choice of a classification system should bs based on the purposc(s) it is 
meant to serve. Within education, the purpose is to effectively instruct 
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children. However, historically classification procedures and systems of 
categorization have evolved from such nontcaching disciplines as psychol- 
ogy and medicine. Increasingly these systems and procedures have been 
recognized to have poor applicability to the purposes of instruction. There is 
little evidence to suggest that current procedures (i.e., assessing the child 
with a standardized instrument and then identifying him or her as 'J), EMR, 
etc.) provide data relevant to decisions regarding the type of instructional 
program needed to improve the academic performance of special needs 
students (Jenkins and Pany, 1978). Nevertheless, enormous amounts of 
energy are funnelled into diagnosis and placement procedures under the 
current system (Ysscldyke and Algozzinc, 1983). Such a focus diverts 
attention from more substantive issues, such as identification of the areas of 
knowledge and skill that low-achieving students lack. For a more thorough 
discussion of problems associated with current classification procedures, the 
reader is referred to Hobbs' (1975) major work in this area. Based on his 
findings, Hobbs has concluded that current classification and placement 
procedures used to assign students to special classes arc "a piajor barrier to 
the efficient and effective delivery of services to them and their families" 
(Hobbs, 1975, p. 1274). 

Knowledge Base on Effective Instruction 
for Low-Achieving Students 
There is a growing consensus on the features of instruction which success- 
fully move the lower quartile of stude.its closer to the mean. We have briefly 
touched upon some of these in the first part of the chapter; a more in-depth 
discussion and analysis of effective instruction for at-risk students is 
presented in the final section. At this point, it is important to note that a 
significant attribute of these instructional features is that they tend not to 
vary across programs or. settings, and features which are found to be 
effective are effective in both compensatory and special education contexts. 
The presence of these effective instructional practices that do not seem to be 
tied to specific types of low-achieving children or to specific settings is a 
powerful argument that challenges the instructional rationale for separate 
systems. 

In summary, t>>c arguments against separate programs include (I) the 
decreased opportunity to learn in special settings; (2) the lack of specific 
instructional matches for traditional diagnostic labels; and (3) the existence 
of effective instructional features across settings and types of handicap. It 
has been our position that the weight of these arguments places the burden 
of proof on those individuals who advocate continuation of separate 
instructional programs for students at risk. The evidence presented above, 
we feel, provides serious challenges to the instructional efficacy of separate 
educational systems for low-achieving students. Considered in tandem with 
the disadvantages associated with "pullou'r programs, a strong case 
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emerges for serious consideration of a more unified instructional system. 
Such a system would merge the instruction of all types of low-achieving 
students and also, to the greatest extent possible, integrate instruction for 
low-achieving students with the mainstream educational program. More- 
over, the existence of a set of instructional features which arc effective 
across types of students and educational settings provides a foundation upon 
which to begin to assemble the necessary components of such a merged 
instructional program. The guiding principle of the program would be the 
targeting of educational treatments based on the type of instructional 
support needed, thereby labeling instruction and not children. 



INSTRUCTIONAL CONSIDERATIONS 
UNDERLYING A MERGED SYSTEM 

In this final section, a brief rationale for an instruction-driven approach to 
the development of a merged system is presented followed by a discussion of 
the current state of knowledge on effective instruction for at-risk students. 
The section concludes with an overview of promising agendas for continued 
development of the knowledge base on effective instruction for low- 
achieving students. 

An Instruction-Driven Approach: 
Putting the Horse Before the Cart 

Previous attempts to improve programming for special education and 
compensatory education students have often focused on noninstructional 
variables. Many efforts have emphasized the physical or temporal integra- 
tion of low-achieving children without concomitant or prior attention to 
instructional integration. In other words, improvement efforts have often 
been defined in terms of the setting in which at-risk students arc instructed 
and the amount of time they spend in various settings. For example, Epps 
and Tindai (1987) found that one of the properties of mainstrcaming 
programs thai is most often examined in efficacy studies is temporal 
integration, referring to the amount of time students spend in regular 
classrooms with their nonhandieapped peers. 

Throughout history, much attention has been devoted to the question 
of the most appropriate setting in which to educate low-achieving students. 
Much of the early work in this area (prc-1970) hypothesized that separate 
classes were more effective (Heller, Holtzman, and Mcssick, 1982). Since 
the late 1960s and early 1970s, however, the majority of investigations 
have hypothesized that the mainstream is the most efficacious placement 
(Madden and Slavin, 1983; Polloway. 1984). The focus throughout. 
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however, has been more on administrative arrangements than on instruc- 
tional variables. Building on past work (Leinhardt, Bickel, and Pallay, 1982- 
Leinhardt and Pallay, 1982) and relevant recent work (Epps and Tindal! 
1987), we take the position that debates that focus on the best setting (e.g., 
resource room versus self-contained classroom, etc.) in which to educate 
low-achieving students miss the mark. It is our belief that setting, because it 
is an easily understood, highly visible, and manipulate variable, has been 
overplayed by policymakers, researchers, parents, and teachers. 

Furthermore, findings from studies that employed setting or time spent 
with nonhandicapped peers as the major independent variable have been 
inconsistent. Neither self-contained classrooms, resource rooms, nor regu- 
lar classroom placement has been unequivocally supported as the most 
effective treatment. Many investigators (Epps and Tindal, 1987; Leinhardt 
and Pallay, 1982) have interpreted these findings as evidence that setting, as 
a global construct, is an inappropriate variable for efficacy studies. In faci, 
setting may often mask more significant, fine-grained variables such as the 
features of instruction that lead to positive student learning. It is our belief 
that educational setting is not the salient variable that determines the 
success of instruction. Rather, certain features of educational treatments 
affect student outcomes. 

For the remainder of this chapter, we ask the reader to put not only 
setting questions, but also other programmatic and policy questions on hold, 
not because they arc unimportant, but rather because to address them a 
priori is putting the cart before the horse. Instead, we would like to discuss a 
vision of a merged system that is built on evidence of effective instruction 
for low-achieving children. Using effective instructional practices as the 
foundation, issues of setting, timing, and policy can subsequently be 
addressed. 



The Instructional Knowledge Base 

Knowledge regarding what constitutes effective instruction for low- 
achicving students comes from diverse sources. First, a summary of the 
findings from studies of special education and compensatory education 
described earlier in this chapter will be presented, followed by a brief 
overview of recent work in the area of effective insruction for low-achieving 
students. Subsequently, we discuss a second line of work, research in the 
area of cognitive strategics, and highlight its implications for the instruction 
of at-risk students. Analyses of these two lines of work reveals substantial 
progress as well as several nagging concerns. The concerns, v/c shall argue, 
seem to revolve around insufficient attention to the content of instruction. 
The chapter concludes with a discussion of the potential of a third line of 
work, which is informed by serious consideration of both what is taught (the 
content of instruction) and how it is taught (the instructional approach). 
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Instruction That Works with Low- Achieving Students 
Much of the initial knowledge concerning effective instruction for low- 
achieving students was gained by the evaluations of compensatory and 
special education programs described earlier in this chapter. Briefly, most of 
these evaluations have found that interventions that arc bchaviorally fo- 
cused and spend the majority of time on highly directed, teacher-led 
instruction in basic skills in reading and math are effective in improving the 
achievement of at-risk students (Becker and Gcrstcn, 1982; Bcrcitcr and 
Kurland, 1981-1982; Gcrsten, 1984; Gcrsten, Carninc, Zorcf, and Cronin, 
1986; Leinhardt, 1976, 1977). Other components of successful programs 
included frequent monitoring of student work, and the provision of per- 
formance-based feedback and appropriate reinforcement (Stcbbins ct ah. 
1977). Evaluations of Chapter I programs have also identified the quantity of 
time spent on instruction as a potent indicator of student achievement. More 
specifically, the opportunity to learn material that is subsequently tested has 
been shown to correlate with student mastery of basic skills (Coolcy and 
Lcinhardt, 1980). As noted earlier, these studies of special and compen- 
satory educational practices have provided very similar pictures of effective 
instruction regardless of programmatic or student classification labels. When 
grouped under the label of direct instruction (Roscnshinc, 1983), the 
components are often summarized as systematic academic instruction 
planned and presented at least in part by the teacher, along with active 
academic responding by the students and modeling, monitoring, and feed- 
back by and from the teacher. The main characteristic is focus and a lack of 
meandering around in the content under study. 

Other more recent approaches to the identification of effective instruc- 
tional techniques have aimed to identify features of instruction that arc 
particularly successful for low-achieving students who arc cither main- 
streamed or taught within Chapter I classes. Three efforts in this regard arc 
reported on in this book. Larrivcc (Chapter 10) has extended the process- 
product methodology frequently used in the effective teaching literature in 
general education to the identification of features of instruction that can aid 
low-achieving mainstreamcd students. Her findings closely parallel the 
process-product findings from the regular education literature. In particular, 
effective teachers asked factual questions that were most often answered 
correctly by students, and gave positive feedback to correct answers as well 
as supportive, clarifying feedback to incorrect answers. They rarely 
criticized, were very responsive to students, and held high expectations. 
Interventions were supportive, student contact was teacher-initiated, and 
on-task behavior was reinforced. Finally, students worked at their own 
instructional levels and were appropriately engaged in learning activities, 
disciplinary actions were seldom needed. 

Crawford (Chapter 9) has pulled together information from diverse 
sources (Brophy, 1986; Coolcy and Lcinhardt, 1980; Crawford, 1983; 
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Griswold, Cotton, and Hansen, TS5) and has identified consistent themes 
regarding school, teacher, and s dent variables that impact the achieve- 
ment of educationally disadvantaged students. His synthesis identifies the 
following instructional activities as conducive to achievement gains in 
Chapter 1 classes; active, teacher-led instruction; busy monitoring of 
student learning; high expectations communicated by relatively difficult 
scatwork assignments, challenging verbal questioning, and a focus on 
academic content; recognition of positive student performance; one-to-one 
interactions (primarily in primary grades reading); content coverage and 
maintenance of a brisk pace (in math); encouragement of student involve- 
ment; and efficient management of nonacademic classroom activities. As 
with Larrivcc's work, the above effective teaching behaviors have been 
identified using process-product methods. Although many of Crawford's 
findings ?grcc with the results of process-product work done in regular 
classrooms, some arc not congruent. Crawford argues that more studies are 
needed iiiat examine the effect of setting or context on effective instruction 
with low-achieving students. 

Finally, the approach used by Slavin and Madden (Chapter 2) has 
itr ^ved systematic analysis of in-placc classroom programs that integrate 
at-risk students into the regular educational program. They first identified 
comprehensive classroom programs that were effective in increasing the 
achievement levels of at-risk students. The two categories of prn?rams that 
were most effective were "continuous-progress" models and cooperative 
learning models. Subsequently they extracted the features or principles that 
arc shared by continuous-progress and cooperative learning programs. 
These features include instruction directed to individual students' needs and 
provided in small, homogeneous groups: instruction from the teacher, not 
from written materials or from peers; constant assessment of students' 
progress accompanied by specific procedures to remediate any small prob- 
lems before they become large ones; and the existence of a structured 
hierarchy of skills that students arc expected to master. 

The above characterizations of effective instruction for low-achieving 
students have several features in common. First, the teacher emerges as a 
central figure. Currently, the best-developed materials or technological tools 
arc no match for a competent teacher who carries the material directly to the 
student. Second, effective instruction is clearly tied to academic content. 
Training in perception, motor skills, social skills, and other indirect activi- 
ties docs not promote development of school-based cognitive skills. Third, 
students must spend sufficient amounts of time being taught and practicing 
academic skills. Mastery docs not materialize from brief encounters, but 
rather develops with academically engaged time spent on material that is at 
an appropriate level of difficulty and that is subsequently tested. Finally, 
monitoring, feedback, and high expectations arc significant elements in 
increasing student learning. It seems noteworthy that a common set of core 



ERLC 



Instructional Issues 



features has emerged from such a diverse group of studies. Moreover, the 
shared features arc uniform across setting (i.e., studies of what is effective in 
mainstreamed settings reveal similar features as the studies conducted in 
compensatory and special education settings). 

Nagging Concerns 
Although the preceding work provides an overall sense of optimism, several 
concerns remain. One is the growing acknowledgment that the effects of 
direct instruction have not been documented with respect to the acquisition 
of higher-order cognitive skills. A second concern relates to the difficulty of 
developing guidelines for improved practice using some of the current 
findings (e.g., increasing timc-on-task, monitoring student practice, giving 
performance-based feedback, setting high expectations, etc.). Each of these 
concerns is discussed below. 

Direct Instruction and Higher-Order Cognitive Skills. The power of 
direct instruction to facilitate the learning of lower-level skills such as basic 
facts has been firmly established. Not only do direct instructional techniques 
assure that teachers arc teaching what is supposed to be taught, but also the 
segmentation of the to-bc-lcarncd material into manageable pieces along 
with active responding by students seems to be especially effective with low 
achievers. Nevertheless, concerns are growing regarding the efficacy of 
direct instruction with regard to the learning of higher-level cognitive skills 
such as problem-solving and reading comprehension (Peterson, 1986). This 
concern has been fueled by recent reports regarding the inability of students 
in general to engage in these high-level comprehension and problem-solving 
activities. Recent results of the National Assessment of Education Progress 
(NAEP), for example, have indicated that, although students arc mastering 
the "basic" computational and decoding skills of mathematics and reading, 
they arc failing to attain more abstract levels of competencies such as math 
problem solving (Carpenter, Corbitt, Kcpncr, Lindquist, and Rays, 1981). 
Similarly, declines in scores on reading comprehension tests have been 
documented beginning in the 1960s (Chall, 1983). The lack of these skills 
among low achievers has been viewed as especially worrisome because 
teachers often provide little exposure to such higher-level skills to this 
population. Despite evidence regarding the importance of high expectations 
for low-achieving students, the objectives of many "low 4 * reading and math 
groups remain at the 44 masfcry-of-facts° level. For example, it has been 
documented that whereas students in high reading groups come away 
understanding that the purpose of reading is comprehension, students in low 
reading groups often come away thinking that the sole purpose of reading is 
to read individual words accurately (Allinglon, 1983). 

It has been suggested that direct instructional techniques arc not suited 
to the development of higher-order cognitive skills (Doyle, 1983). However. 
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several factors may be clouding the issue. First, measurement of higher- 
order understanding and cognitive skills is more elusive than assessment of 
basic facts. Although the psychomctrics underlying standardized achieve- 
ment tests (mostly tests of basic skills and facts) have become more and 
more refined, the validity and reliability of measures of problem-solving and 
other higher-order cognitive skills arc still debated. Therefore, it may be 
unfair to indict direct instruction (or any other instructional approach) as 
inadequate for the teaching of higher-order skills. Second, the majority of 
direct instruction has focused on lower-level skills. It seems unreasonable to 
expect that instruction in decoding or pronunciation will increase students' 
text comprehension, or that instruction in addition facts will increase 
students' ability to solve word problems. For example, increases in student 
engaged time during reading lessons with an emphasis on accuracy have 
been associated with increased scores on measures of lower-order skill 
attainment, but not on measures of higher-order skills (Anderson, Mason, 
and Shirley, 1984). These findings might be interpreted as supporting the 
notion that direct instruction is not appropriate for the development of 
higher-order skills. A competing interpretation, however, is that the 
students simply learned what they were taught. No doubt, quantity of time 
will not make a difference if what is being taught is not what is subsequently 
tested. We have referred to this phenomenon elsewhere (LeMahicu and 
Leinhardt, 1985: Leinhardt and Sccwald, 1981) as overlap. 

Rather than question the overall appropriateness of direct instruction, 
it seems we need to begin to address the question, "Direct instruction of 
what?" We need to recast our notion of direct instruction from that of a 
comprehensive instructional picscription to that of an institutional ap- 
proach for teaching material the nature of which must be specified. Logi- 
cally, one could talk about using a direct instructional approach for the 
teaching of problem-solving or other higher-order cognitive activities such 
as writing an interpretive essay. The practical problem is that we arc not 
adequately knowledgeable regarding what comprises expertise in these more 
complex domains. Researchers arc beginning to make some progress toward 
identifying the component knowledge and skills that comprise expert 
performance in higher-order knowledge and skill areas. These advances will 
have important implications for instruction in more complex domains. We 
will review promising developments and emerging research agendas in this 
area later in this chapter. 

Limitations of "Generic" Instruction. A second concern with re- 
spect to the current evidence regarding effective instruction for low- 
achieving students is what can be referred to as the "vacuous 4 ' nature of 
many of the teaching principles derived from process-product and other 
closely related kinds of research programs. Research carried out in the 
process-product tradition attempts to correlate gains in student achievement 
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with certain teaching behaviors, such as the types of questions asked and the 
amount of feedback given. This research, by and large, has focused on the 
general or "generic" teaching behaviors shared by effective teachers. The 
attempt has been to span many teachers and teaching situations with a focus 
on global behaviors in order to allow for replication and gcncralizability of 
the findings. This strategy has had a measure of success in that the 
process-product paradigm has generated many findings that have been 
replicated across studies, whereas other earlier research paradigms had not. 

Summaries of studies using the process-product methodology often 
conclude with a "composite" of an effective teacher. Both Larrivcc's and 
Crawford's summaries of findings (Chapters 9 and 10) arc examples of 
composite summaries of effective teachers (i.e., the effective teacher asks 
factual questions, rarely criticizes, holds high expectations, etc.). Such 
summaries arc attractive to policymakers and teacher trainers because they 
(I) often intuitively "make sense" and (2) can be put into the foim of 
propositions which can then be advocated as prescriptions for classroom 
practice (e.g., teachers should provide sufficient wait timo during oral 
questioning of students; teachers should use ordered turns in reading 
groups). Recently, however, the efficacy of this approach as a guide for 
practice has been questioned (Shulman, 1987). The question* revolve 
around the way in which composite summaries arc produced, including the 
nature of the variables that make up composite summaries of effective 
teachers. Composite summaries arc drawn from data collected on general, 
yet isolated variables that have been aggregated across several kinds of 
teachers and teaching situations. Researchers interpret and integrate these 
findings when they produce composite summaries of effective teachers. It is 
important to realize that the composites arc not descriptions of actual 
teachers who have been observed by the researchers. Rather, they arc the 
product of researchers who "pull together" findings from a variety of 
teachers. When guidelines for actual classroom practice arc inferred from 
composite summaries, they often seem incomplete. What is lacking is 
information on when and how to apply the various principles or proposi- 
tions. 

A major limitation, then, is the lack of attention to the specific 
contextual factors that give texture and real-world (face) validity to the 
findings. Without more fine-grained attention to how these findings can be 
applied in specific contexts, thci r effect on practice becomes diluted. It has 
been suggested that some of these findings can perhaps more reasonably be 
viewed as indicators of effective teaching as opposed to a descriptive 
portrait of what constitutes effective teaching. For example, Shulman (1987) 
has argued that the reasons behind these findings ought to be analyzed and 
used to guide practice. In fact, investigators have already begun to look 
' beyond" these initial findings, moving toward more fine-grained descrip- 
tions of the thoughts and actions of effective teachers and their students. 
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The intensive work on the variable of time provides an excellent 
example of the evolution of a variable from a proccss-product-gcncratcd 
indicator of effective teaching to a more descriptive and in depth characteri- 
zation of what good teaching entails. Time is an important schooling 
variable. Its value relates to the fact that it is a finite commodity and that it is 
directly related to the opportunity to learn. The opportunity to learn, in turn, 
has been found to influence the amount that is learned (Carter, 1984; Fisher 
ct al„ 1978), Initially, the "discovery" of rhc importance of time was 
heralded with much optimism because time is a highly visible and manipul- 
ate construct. More than the earlier variables that consumed researchers' 
attention (e.g., SIiS), time was a variable over which teachers and principals 
(and legislators) felt they could exert some control, 

it was not unusual for very early work in this area to make extravagant 
claims about the effects of time. For example, after reanalyzing a subset of 
the Coleman, Campbell, Hobson, McPartland, Mood, Wcinfcld, and York 
(1966) data, Wiley and Harnisehfcgcr (1974) concluded that simpiy increas- 
ing exposure to instruction (defined in terms of student attendance x 
number of hours per day * number of school days per year) would lead to 
increased student achievement. This initial enthusiasm was soon tempered, 
however. In a replication of Wiley and Harnischfcgcr's analyses using 
additional data sets, Karwcit (1976) found that the effects of time were not so 
straightforward. Karwcit (1977) went on to argue that previous analyses of 
the relationship between quantity of time and student achievement were 
incomplete, neglecting, in particular, attention to how time is spent. Soon 
afterward. Gage (1978) criticized continued investigations into engaged time 
with the observation that time is an empty vessel. 

A new wave of research followed that sought to fill the "empty vessel" 
of time with more descriptive accounts of what was happening in classrooms 
and in the minds of teachers and students. Studies of timc-on-task began to 
ask, "Time on what task?" In other words, the research community began 
to acknowledge that the amount of time engaged was not as important as 
how that time was spent. This represented a major shift in how time was 
treated in investigations of school learning. From an early focus on time as a 
variable in its own right, attention shifted to the use of time as a method of 
scaling information ahiuil other potential^ important classioom events 
(Lcinhardt, 1985). 

Thi% brief history of research on the variable of time is presented in 
order to highlight the crucial role played by those early inquiries. It is 
important to note that, although initial work in the area of time is now 
criticized as vacuous, it did provide a marker or a signpost that pointed 
researchers to an important area of classroom life heretofore neglected. 
Subsequently, researches began to "unpack" this variable with ever- 
increasing, fine grained analyses into what constitutes the effective use of 
time and why time spent is so important. 
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Most importantly, the evolution of research on time has had important 
implications for guiding the instruction of at-risk students. For example, we 
have already learned that time should be spent on material that we want 
children to learn. Tha' is, students learn more when they arc spending their 
time on material to be learned rather than on activities designed to teach 
them in a more indirect way (Carter. 1984; Cooley and Lcinhardt. 1980; 
Gcrstcn. 1984; Lcinhardt and Sccwald. 1981: Lcinhardt. Zigmond. and 
Cooley. 1981; Roscnshinc. 1978). Moreover, continued attention to how 
time is spent and why time is important will most likely provide an even 
more precise understanding of how to guide instruction for low-achieving 
students. For instance, in the future, we may learn that time has different 
implications for lower-order versus higher-order learning. Perhaps rapid 
pacing is conducive to the learning and maintenance of lower-order knowl- 
edge such as addition or subtraction facts, but not as appropriate for 
instruction on more complex topics such as problem solving in mathematics 
or other higher-order concepts in science or history. 

In addition to the variable of time, the work to-datc on effective 
instruction for at-risk students has provided us with many other indicators of 
effective teaching. Reviewed earlier, they include close monitoring of 
student performance, consistent feedback, and high and clear expectations 
With these findings (and others) serving as signposts, researchers are now 
primed for more in-depth analyses of the hows, whens. and whys underlying 
these indicators of effective teaching. 

Recent work suggests that advances in this area will be tightly 
intertwined with inquiries into how teaching and learning occur in various 
subject matter domains (Shulman. 1986). In other words, how teachers 
organize and deliver the content of instruction is seen as an important, but 
neglected aspect of effective teaching. For example, in order to provide 
effective guides for practice, we will need studies that tell us not only how 
rapidly to pace a unit on fractions, but also what the scope and content of 
that unit should be. Considered together with our knowledge of effective 
structural features of instruction (c.g , the importance of pacing, overlap, 
gioup size, etc.). additional knowledge of effective content features should 
provide us with a well-rounded guide to practice. The expertise of asking 
and answering this next wave of question is just beginning to develop; it is 
intertwined with the work on expertise in complex knowledge domains 
discussed earlier. As promised earlier, emerging findings in this area will be 
highlighted later in the chapter. 

Cognitive Strategy Instruction 
Another line of research that has informed our knowledge regarding 
effective instruction for at-risk students is work in the area of cognitive 
strategics. Cognitive strategies refer to skills, such as rehearsing, elaborating, 
organizing, summarizing, and self questioning, that facilitate the acquisition 
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of knowledge. Cognitive strategy work holds significance for the instruction 
of low-achieving students because: (I) evidence suggests significant differ- 
ences bet ween good and poor learners in their use of cognitive .strategics and 
(2) work in the area of cognitive strategy training indicates that strategies can 
be taught successfully to low-achieving students and that, once learned, use 
of the strategics leads to increases in student learning. Each of these i.s 
discussed below, followed by a discussion of nagging concerns in this area. 

Lack of Spontaneous Strategy Use by Poor Learners. Low- 
achieving students a-c often observed to exhibit deficits in the use of 
learning strategics. For example, early work in this area (Torgesen, 1977) 
revealed that poor students do not use the strategy of categorization as an 
aid when asked to remember a scries of pictures or words. Similar 
observations regarding the failure of LD children to spontaneously use 
effective strategics have been made by Tarvcr, Hallahan, and Kaufmann 
(1976) and by Dawson. Hallahan, Reeve, and Ball (1980) in their studies of 
selective attention. In addition, Campionc and Brown (1977) have noted that 
mentally retarded children do not use strategic behavior spontaneously and 
have difficulty selecting, modifying, and sequencing strategics. Similarly, 
Borys (1979) has demonstrated that retarded children have problems with 
self-questioning. 

"uch of the recent work in this area has hecn in the domain of reading 
comprehension. A number of reviews have been published of studies 
investigating good-rcadcr/poor-rcadcr differences in text processing (e.g., 
Golinkoff, 1976; Ryan. 1981). These reviews show consistent differences 
between good and poor readers in how they approach the task of reading. In 
general, studies point to the failure of poor readers (I) to conceptualize 
reading as a search for meaning rather than as a decoding task; (2) to monitor 
when they arc experiencing problems in comprehension: (3) to engage in 
strategic behavior that will bring meaning to the text or restore meaning if 
there has been a breakdown in comprehension; (4) to skim, scan, reread, 
lake notes, or make inferences: and (5) to modify how they read passages 
based on the difficulty of the material and/or their reasons for reading it. 

Ev ; jcncc regarding different levels of strategy use between good and 
poor learners in another domain (mathematics) comes from a scries of 
studies conducted by Peterson and her colleagues (Peterson, Swing, Braver- 
man, and Buss. 1982: Peterson and Swing. 1982: Peterson, Swing, Stark, 
and Waas, 1984; Peterson. 1986). They explored students' reports of their 
thoughts during both simulated and classroom instruction in mathematics. In 
general, they found that iow-achjeving students do not spontaneously use 
the sophisticated kinds of strategies that have been identified by researchers 
as effective. High-achieving students, on the other hand, were more likely to 
report engaging in the following cognitive processes during classroom 
instruction in mathematics: (I) attending to the lesson, (2) understanding the 
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lesson, (3) cither employing a variety of specific cognitive strategics or 
engaging in these cognitive strategics more frequently, (4) engaging in 
strategics that involved pi^blcm-solving steps or showed insights into the 
mathematics, and (5) using the specific strategy of relating the new informa- 
tion to be learned to prior knowledge (Peterson, 1986, p. 22). 

In general, results from a diverse group of studies have suggested 
inferior use of learning strategics by low-achieving students. Although these 
findings arc important, they arc also mostly descriptive; that is, the objective 
of these studies was to describe cognitive strategy use rather than to 
generate information on how to teach cognitive strategies. Another line of 
work, however, examines how children learn to use strategics and the role 
that instruction can play in facilitating such learning. Wc turn now to a 
discussion of these themes in cognitive strategics research. 

Cognitive Strategy Instruction. Cognitive strategy instruction con- 
sists of deliberate, planned training that aims to teach students learning 
strategics that will enhance their knowledge and skills acquisition. Many 
training programs also work toward the development of mctacognitivc skills, 
that is, student awareness of when and how to use strategics. Currently, 
there no consensus on the best way to train low-achieving students to 
acquire, use, and monitor cognitive strategics. Strides have been made, 
however, toward making cognitive strategics training research more infor- 
mative to the actual classroom instruction of low-achieving students. Most 
early studies in this area wcic conducted in the laboratory (e.g., Schumakcr, 
Dcshlcr, Alley, Warner, and Denton, 1984; Wong and Jones, 1982). Al- 
though providing valuable information, their application to classroom prac- 
tice was considered tenuous for several reasons. First, the majority of 
laboratory training studies were of short duration and consisted of training 
conducted by the invcstigator(s) rather than the classroom teacher. Second, 
learning outcomes measures were often narrowly defined and usually related 
directly to the experimental task (Peterson and Swing, 1983). Hence, even if 
successful student learning was demonstrated, the implications for ongoing 
academic work in the classroom remained unclear. 

More recently, cognitive strategy training research has moved into the 
classroom (Brown and Palinscar, 1982; Palinscar and Brown, 1984; Paris, 
Cross, DcBrilto, Jacobs, Oka, and Saarnio, 1984). These studies arc 
characterized by the training of classroom teachers followed by the delivery 
of cognitive strategy instruction by the teachers to their own students over 
extended periods of time. One of the most frequently cited approaches to 
cognitive strategy training is reciprocal teaching (Palinscar and Brown, 
1984). The objective of reciprocal teaching is student acquisition and 
effective use of comprehension-monitoring strategics. Wc will describe this 
approach briefly because it can be used to illustrate several important issues 
in this area of study. 
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Reciprocal teaching in reading can be characterized as a dialogue 
between teacher and students that takes place in a cooperative learning 
group and is structured by four specific comprehension-monitoring 
strategies. The four strategies arc (!) summarizing (identifying main idea 
information); (2) questioning (self-questioning on information identified as 
important); (3) demanding clarity (noting when there has been a breakdown 
in comprehension and restoring meaning); and (4) predicting (hypothesizing 
what the author will discuss next). The procedure of reciprocal teaching 
features guided practice in applying these strategics to the task of text 
comprehension. The overall goal is the joint construction of the meaning of 
the text. 

Participants in the dialogue take turns assuming the role of teacher. In 
the beginning of training, the teacher assumes primary responsibility for 
initiating and maintaining the dialogue. The teacher both provides instruc- 
tion in and models the use of the four strategics. With each session, the 
teacher transfers more responsibility for the dialogue to the students while 
continuing to provide feedback and coaching. At the beginning of each 
session, the group reviews the strategics, the reasons they arc learning the 
strategics, and the situations in which their utilization would be helpful. 

Variants of this approach have been used in many classrooms, but 
Palinscar and Brown were the first to formally investigate the" effectiveness 
of the reciprocal teaching approach. In a scries of studies, Palinscar and 
Brown observed classrooms in which reciprocal teaching was conducted 
primarily by remedial reading teachers in their natural reading groups 
(Brown and Palinscar, 1982; Palinscar and Brown, 1984). The middle-school 
students participating in the studies were identified by their schools as 
rcadr- at least one year below grade level with typical comprehension skills 
at least two years below grade level. The group sizes ranged from four to 
eighteen students and there 'vcrc usually twenty days of instruction 
preceded by five-day baseline periods and both short- and long-term 
maintenance checks. 

Findings from the studies indicated that students did learn the intended 
reading comprehension strategics and, over time, became increasingly able 
to implement them independently. Outside of the group, Palinscar and 
Brown documented large and reliable improvements in performance on daily 
comprehension tests that were independent of the training materials. Fur- 
thermore, improvements were noted on classroom measures of comprehen- 
sion taken in nonrcading classes (e.g., social studies and science) and on 
transfer tests, such as writing summaries, predicting test questions, and 
detecting text anomalies. 

On the surface, reciprocal teaching appears deceptively simple. Closer 
examination of the procedure, however, reveals attention to several dimen- 
sions acknowledged to be important throughout the cognitive-strategics 
training literature. They arc (I) generalization and transfer, (2) the degree to 
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which cognitive-strategies instruction is embedded in academic content, and 
(3) the instructional approach used to teach cognitive strategics. Each of 
these dimensions is discussed below. As will become evident, quite a lot has 
been learned regarding how to teach cognitive strategies, but several 
questions remain. The following discussion attempts to overview both 
ongoing concerns and the current state of knowledge in this area. 

I, Generalization and transfer. A problem that has continually plagued 
cognitive-strategy instruction has been the tendency for students not to 
apply the learning strategics outside of the experimental situation in which 
they were learned. For example, mildly retarded and learning-disabled 
students who were taught learning strategics for dealing with specific 
memory problems often failed to demonstrate ongoing use of those 
strategics unless "nudged" by persistent prompting by the trainer. In 
addition, evidence of transfer to other similar memory tasks was sparse 
(Borkowski and Cavanaugh, 1979; Brown, 1978; Campionc and Brown, 
1977; Torgcson, 1977). It should be noted that the earliest training studies 
were generally not as concerned about this lack of generalization or transfer 
as were later studies. The aim of these earlier studies was to demonstrate 
that the strategics could be taught and that their use would benefit the 
acquisition of knowledge and skills. After much evidence accumulated 
regarding the "teachability" of cognitive strategics, however, researchers 
turned their attention to how continued, appropriate use of the strategics 
outside of the immediate experimental condition could be facilitated. 

The generalization and transfer of cognitive strategics seems to be 
facilitated by explicit training in the role of the strategy as well as training in 
the strategy itself (Brown, Bransford, Fcrrara, and Campionc, 1983). Rather 
than simply requiring students to execute the strategy, the instructor informs 
the students about the use of the strategy and tells them why and in what 
situations it will work for them. In addition, students arc given practice in 
controlling and overseeing the use of the strategies bcin^ taught. In 
reciprocal teaching, for example, instruction continually works toward the 
students' understanding of the goal of the strategy they arc using. At the 
beginning of each session, the group reviews not only the four strategics, but 
also the reasons they arc learning the strategics and the types of learning 
situations in which they would be particularly useful. In addition, reciprocal 
teaching aims to help students monitor the use of their strategics (i.e.. to 
check to sec if the strategics arc working and make appropriate corrections if 
they arc not). 

Within a taxonomy of training approaches described by Campionc, 
Brown, and Fcrrara (1982), reciprocal teaching is an example of self- control 
training, the most comprehensive level of instruction. Campionc and his 
colleagues have categorized training studies according to the degree to 
which students receive training in the usefulness of strategics and the 
monitoring of their use. The first level is called blind training, and refers to 
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an instructional situation in which the student re nains naive regarding the 
role of the to-be-learned strategy. The investigator simply tells the learner 
what strategics to perform wi'h no information regarding why these particu- 
lar strategies have been selected or in what situations they might be 
appropriate. In the second level, informed trailing, the student receives 
information regarding the usefulness of the strategics (e.g., how the 
strategics help performance, with what success, and in what situations). 
This information is intended to increase the learner's awareness of the role 
and significance of the strategics. Finally in the third level, self-control 
training, the student receives all of the information given in the first two 
levels plus instruction in how 'o monitor and evaluate strategy use. The 
student is tanght both how to produce and regulate cognitive-strategy 
activity through explicit instruction in planning, checking, and monitor- 
ing. 

Evidence suggests that students with learning problems profit most 
from interventions that can be described as self-control training (Miller 
I%5; Kurtz and Borkowski. 1985). For example. Kurtz and Borkowski 
randomly assigned 130 fourth, fifth, and sixth graders to one of three 
treatment conditions. In the first condition, the students received straight- 
forward strategy instruction on how to summarize, identify main ideas, and 
generate topic sentences. In the second condition, the students received this 
instruction plus instruction regarding the value of monitoring reading 
strategics, the importance of dcliocratc strategy selection, the flexible use of 
strategics, and the need to work slowly. The third group was a control group 
that simply read the experimental material and wrote summaries. Results 
indicated that the second group that received the mctacognitivc instruction 
pe -formed the best. Thus, although stuoents without diagnosed learning 
problems can often benefit from blind or informed training, remedial 
students instructed under these less explicit conditions frequently fail to 
acquire and/or generalize cognitive strategics. In order for instruction in 
cognitive strategics to have carryover effects for 'ow-achicving students 
one prerequisite seems to be that the students arc made aware of not only 
the significance of the strategics, but also how their use of the strategics can 
be regulated and. if necessary, rca-ijustcd. 

2. Cognitive strategy instruction and academic work. The issue of the 
degree to which cognitive strategy instruction should be embedded in 
regular academic work is related to the issue of genes Nation and transfer 
Should the teaching of cognitive strategics be incorpc-atcd into students' 
regular academic work in the various subjects in order to ensure the most 
reliable use of the strategics in academic learning? Or can cognitive 
strategics be taight separately from academic content ?.,id still result in then 
tnntfer to ac?d mic work" These questions arc often debated and although 
unequivocal ans -ers arc not available, current evidence suggests it is best to 
teach cognitive stiatcgies in the context of regular academic work. 

Reciprocal teaching is an example of cognitivc-^rategics training that 
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is designed to be delivered within an academic work context. In this 
oroccdurc, strategics arc always modeled in appropriate academic contexts, 
not as isolated separate skill exercise*. The four strategics are embedded in 
the actual task of reading, and the students understand that the overall goal 
of the activity is to construct the meaning of the passage being discussed. 

Cognitive-strategics training programs based on the explicit teaching of 
learning strategics indepcrH^nt of specific academic content have also been 
advocated. Many of these . aning programs have been adopted by a number 
of school districts in recent years. These programs arc often referred to as 
thinking-skills programs. Examples include the Instrum ntal Enrichment 
Program (Fcucrstcin, 1980), the Philosophy for Children Program (Lipman, 
Sharp, and Oscanyan, 1980), and Whimby and Lochhcad's short course in 
Problem Solving and Comprehension Fcucrstcin's program consists of 
thirteen types of exercises (e.g., identifying geometric figures embedded in 
dots, completing number sequences) repeated in cycles. After students work 
individually on each exercise, they participate in a classroom discussion that 
provides them with the opportunity to learn from other students' strategics 
The teacher's task throughout is to encourage cognitive activities appro- 
priate to the materials. In the Philosophy for Children Progn m, students arc 
provided with stories about groups of children and adults who engage in 
dialogue concerning issues that arise in the course of everyday life Different 
aspects of these stories illustrate various "thinking abv>ut thinking" skills 
such as assessing factual information objectively, formulating cause-effect 
relationships, and drawing generalizations. The situations in which the 
Philosophy for Children material is posed arc concrete as opposed to abstract 
as in the Fcucrstcin program. Finally , the Problem Solving and Comprehen- 
sion course involves practice on a variety of wcll-dcfincu problems in the 
context of a thinkmg-aioud procedure that provides lor feedback regarding 
the processes of thinking as well as the products of thinking. The problems 
arc of four basic types, verbal reasoning, analogies, analysis of trends and 
patterns, and mathematical word problems. 

Data to support the effectiveness of these approaches arc sparse. 
Fcucrstcin has collected data (1979, 1980) that indicate that teachers enjoy 
and approve of this kind of instruction and that students exposed to this 
instruction have demonstrated improvement on standardized tests of intelli- 
gence. Very little data, however, arc available regarding its effects on 
classi^m learning, particularly in the area of generalization to academic 
skill areas. Li a review, Nickerson (1984) proposed that the developers, 
convinced of the inherent worth of their programs, have not been inclined to 
evaluate them. In addition, an analysis of the these three programs by 
Bransford, Arbitman-Smith. Stem, and Vyc (1985) suggests that, although 
all three programs provide some ingredients necessary for transfer, "there is 
no strong evidence that students in any of the three thinking skills programs 
improved in tasks that were dissimiL. to those already explicitly practiced" 
(p. 202). 
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The debate regarding whether or not cognitive-strategies training 
should be embedded in academic conteiit tic> into a related, unresolved 
issue in this area: the degree and type of interdependence between 
knowledge-base components and strategy components in intellectual func- 
tioning. Recent models of good stratcg> use have suggested that strategic 
knowledge is only one type of knowledge possessed b> competent stratcg> 
users (Prcsslcy, Borkowski. and Schneider, in press). These users also 
possess a great deal of knowledge about the world in general and about 
specific content areas In other words, they also have an extensive declara- 
tive knowledge base Similarly . recent conceptual frameworks of intellectual 
functioning view compclcnt performance as multifacctcd. Intelligent perfor- 
mance is suggested to be comprised of several components, including 
cognitive strategies, inctacognitivc knowledge and skills, and knowledge- 
based components (Brown. Bransford, Fcrrara, and Campionc, !98V Fla- 
vcll. 1985: Sternberg. 1985). 

How strategy and knowledge-base components interact to provide for 
competent performance poses a challenging question. It has been s iggcstcd 
than an individual's knowledge base can have an enabling or an inhibiting 
effect on strategy execution (i.e.. an individual may have learned a variety of 
strategics but the extent to which he or she is able to app'y them will depend 
on the extensiveness of his or her knowledge base). For example. Levin and 
Prcsslcy ( 198 1 ) have suggested that an effective reading strategy is to access 
prior knowledge about a topic before reading a passage about the topic. If an 
individual has no prior knowledge about the topic. Uwcvcr. this strategy 
(no matter how well learned and 'or appropriately accessed) will be ineffec- 
tive A similar example involves the use of categorization as an aid to 
remembering a set of words or pictures. When an individual groups a scr <:s 
of items into meaningful categories, the capacity of her or his .>hort-tcrm 
memory is "stretched." Instead of needing to remember, foi example, 
sixteen separate items, the individual only needs to remember four catego- 
ries These four categories, in turn, stimulate the recall of the individuai 
items \ scuba diver who takes on the task of memorizing the following sei 
of terms will find the categorization strategy much more effective than will a 
land lover: snorkel t regulator, nan osis. < ompass. bends, fins, depth gauge, 
mask, embolism Because the individual terms are meaningful to the diver, 
grouping them in a meaningful way is both possible and effective. 

To date, most approaches to cognitive-strategy instruction do not deal 
extensively with the interaction of knowledge and strategy use. Although 
reciprocal teaching occurs within the context of a particular academic 
domain (i c . reading), the knowledge component of reading is difficult to 
define Reading as it is taught in elementary schools may be a pnn ess-heavy 
kind of intellectual skill: note that once students master vocabulary, syntax, 
and comprehension skills, their reading instructioa per se ceases and they 
proceed to the use of reading in the learning of other subject areas (e.g.. 
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science and socu i studies). The knowledge components of these subjects 
consist of the substantive facts and concepts that make up the discipline; the 
knowledge components of "generalized" reading, on the other hand, cannot 
be so neatly (precisely) defined. Hence, even though reciprocal teaching is, 
in general, viewed as integrated with academic content, one can argue that 
the nature of the knowledge-base components of reading is qualitatively 
different from the nature of the knowledge-base components in other subject 
areas. Thus it may be necessary to develop more examples of instruction 
that attend to both knowledge and strategy components in other domain* 
(e.g., math, science, history, etc.).* Insufficient attention to stratcgy-by- 
knowlcdgc-basc interaction may have potentially the most worrisome effect 
on low-achieving students. These students have been viewed as having an 
impoverished knowledge base in comparison to their normal-achieving 
peers. To the extent that a well articulated knowledge base is a prerequisite 
to enhanced strategy use, the deficits that low -achieving students possess in 
this regard could be particularly devastating. Unless future work in this area 
demonstrates how the knowledge base and strategy use can be developed 
hand-in-hand, continued isolated inquiry into cognitive-strategics training 
with low-achieving children may be nonproductive. 

In order for future work to be inclusive of both strategy and knowledge 
components, much more detailed information is needed regarding the nature 
of competent performance tn complex domains. A revised instructional 
program could then be infuimcd by this fine-grained analysis of both 
strategic and knowledge components of competent performance. Once 
again, the rcadci is reminded that work in this area has only recently begun 
Our discussion highlights, however, that the pay-off of investigations into 
the nature of performance in complex tasks may extend to advances in 
cognitive-strategics training programs as wcil. 

3. Instnutional approach. Instruction in cognitive strategics can take a 
number of forms, ranging from discovery methods to direct instruction 
approaches. We will briefly review three instructional approaches- 
discovery, direct instruction, and prolcptic instruction— with particulat 
attention to the appropriateness of each approach for low-achieving 
students. 

Discovery methods arc often promulgated as the idea! approach for 
promoting students' dcvclupmcnt of thinking skills. Discovery approaches 



* This discussion is no! mean! to give the impression, however, that knowledge is nol 
important in reading. Bransford (1979) and others iSpihch. Vcsondcr. Chicsi. and 
Voss. 1979) have done extensive work that demonstrates the important role of 
background knowledge in leading comprehension The issue is more related to ease 
of characterization of the knowledge base and iperhaps) the degree to which less 
knowledge can be oflsct by more strategy. 
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arc characterized by an unstructured learning environment and student 
directed learning. Children are encouraged to engage in a number of 
stimulating activities and given the opportunity to ask questions of both 
peers and teachers. Advocates note that learning by discovery is motivating 
and that it leads to retention and deep understanding, thereby increasing 
chances of transfer. A related approach, guided discovery, suggests a more 
structured environment which, its proponents argue, more often leads to 
student "discovery" of appropriate strategics. In a guided discovery lesson, 
for example, the teacher arranges for his or her students to experience a 
conflicting situation, the resolution of which requires development of 
sophisticated strategics. The challenge is how to engineer learning cnccin- 
ters so that the discovery of appropriate strategics takes place. An example 
of a guided discovery method is inquiry teaching (Collins and Stevens, 
1982). It should be noted that critics of the discovery approach (e.g., 
Wittrock, 1966) question its efficiency as <*n instructional method, especially 
with slower learners. 

Direct instruction approaches fall at the other end of the spectrum, 
they arc characterized by highly structured, tcachcNcd training. Direct 
instruction approaches vary in their degree of comprehensiveness. On the 
one hand, direct instruction of cognitive strategics can include training in the 
strategics themselves along with verbal encouragement tiiat students use the 
strategics On the other hand, direct instruction can also include explicit 
explanations regarding when and how io use the strategics along with 
teacher demonstration and student practice with subsequent teacher feed- 
back. Duffy, Rochlcr, Vavrus, Book, Mcloth, Putnam, and Wcssclman\s 
(1984) work on explanations, referred to as direct explanations, is an 
excellent example of comprehensive direct instruction in cognitive 
strategics. (Evidence of the effectiveness of their direct explanation ap- 
proach with low-group, third grade students has been suggested by Rochlcr, 
Duffy. Putnam, Wcssclman, Sivan, Rackliffc, Book, Mcloth, and Vavrus 
1 1986|.) 

The final method is termed dyadic, or prolcptic, instruction and h 
rooted in the learning theories of Vygut.sk) (1978). According to Vygotsky, 
learning occurs through social mediation in heterogeneous, coopciativc 
group settings. Less expert group members observe the more advanced 
learning strategics of more expert members, gradually they begin to inter- 
nalize the strategies and take on more and more responsibility for their own 
learning. The explicit communication of effective strategies by the expert 
members is often viewed as a support or "scaffold" for the initially halting 
learning of the less sophisticated members. An important aspect of the 
scaffold concept i* that provisions arc made for its gradual removal as 
learners become less and less dependent on it foi learning. The reciprocal 
teaching procedure described earlier is an excellent example of scalToldca, 
or prolcptic, instruction As noted earlier, after the four strategies arc 
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explicitly taught and modeled in a group situation, individual children arc 
assisted in gradually internalizing the strategics and eventually leading the 
group in their use. 

Prcsslcy, Snyder, and Cariglia-Bull (in press) have recently reviewed 
these three approaches to cognitive-strategics instruction along with several 
other approaches. In summarizing their information, they were very careful 
to note that long-term experiments comparing different ins'ructional ap- 
proaches arc lacking and hence definitive answers arc far off. Given the 
current evidence, however, Prcsslcy and associates concluded that com* 
prehensivc direct instruction (e.g., Duffy ct al., 1984) and reciprocal 
teaching seem to be the most powerful approaches for student acquisition 
and generalized usi of cognitive strategics. They go on to offer that prolcptic 
instruction may be the best choice for use with low-achieving students: "It 
seems to us that of all the alternative teaching techniques, prolcptic 
instruction i.s the one most likely to take any given nonstratcgic child and 
turn her/him into a competent strategy user" (p. 49). 

Similarly, Brown and Palinscar (1982) conducted a study aimed at 
analyzing the instructional components of reciprocal teaching. Training in 
the four comprchcnsion-monaoring strategics was conducted under four 
different conditions. One was the traditional reciprocal teaching, a form of 
prolcptic instruction. The second condition, reciprocal teaching 'practice, 
included similar initial instruction, followed by much practice opportunity 
but minimal feedback. The third condition, demonstration, permitted little 
group interaction or practice. Finally, the fourth condition involved intro- 
duction of the skills (one at a time) via woikshccts with lots of student 
practice with individual teacher assistance, thus providing the opportunity 
for a lot of teacher-student interaction. Results of the study suggest that 
traditional reciprocal teaching was nust effective, followed by (but not 
closely) the fourth approach (worksheets and practice). 

What is notcwurthy about these findings is that a more economical 
means of cognitive-strategics training that is as reliable as prolcptic instruc 
tion has not been identified. With low-achieving students, this mediated 
foim of instruction, although most time-consuming, is also most effective. 
Essentially, prolcptic instruction u individually tailored instruction. Al- 
though Palinscar reports teachers successfully using this technique in groups 
as large as eighteen, the feasibility of widespread adoption in regular 
classrooms as the) exist today has been questioned. The problems would 
essentially be the sanic set of problems encountered in implementing an 
individualized program, limited time of the teacher to be a resource to each 
individual, keeping others students actively and meaningfully engaged while 
the teacher attends to an individual, and the difficulties involved in tracking 
separate student learning paths. 

In addition to adjustments based on the iabor-mtensiveness of this 
lorm of instruction, adoption in classrooms of proicptL forms of instruction 
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would also entail a rcconccptualization of the role of the teacher. Many 
current teachers have adopted a role much more aligned with the direct 
instructional approach, that is. the teacher's responsibility is to (I) decide 
what is to be taught and (2) transmit that knowledge to students as 
effectively and efficiently as possible. Thus the teacher is in a very 
dominant, active role. By contrast, prolcptic instruction places the teacher 
more in the role of a coach In addition so modeling correct performance, the 
teacher must make decisions regarding the level of competence of each 
student Then the teacher must provide supportive prompts and feedback to 
move the student to the next level of competence. The student, however, 
docs most of the work, the teacher's job is to provide sensitive diagnosis, 
modeling, prompting, and feedback in order to move the student to higher 
and more independent levels of learning. The complexities involved in 
recasting teachers into the coaching role would require large-scale changes 
in teacher education programs. 

In summary, the present data suggest several considerations that 
should underlie cognitive-strategics training with low-achieving students. 
First, poor learners need explicit instruction in how and when to apply 
cognitive strategics as well as how to monitor their use When instruction 
stops at the level of simple instruction of what the strategics arc, low- 
achieving students have difficulty applying them outside of the immediate 
training situation. Second, cognitive-strategy training has the best chance of 
being incorporated into the pcor learner's repertoire if it is integrated with 
regular classroom instruction in the various subject matter domains. Al- 
though explicit teaching of cognitixc strategies separate from academic 
content may have many positive benefits (teachers like it, it improves IQ 
scores), there is little evidence to suggest that it consistently helps students 
in their academic work. Additional investigations into the ,iaturc of o.pcrtisc 
in various domains will be required in order to provide cognitive-strategics 
instruction that is meaningfully intcg-^cd with the knowledge-base com- 
ponents of various academic domains. 'Jniid. and last, cuncn; evijence 
suggests that the prolcptic instructional approach <is illustrated by reciprocal 
teaching seems to offer the surest bet that cognitive strategics will be 
acquired, used, and monitored with regularity by low-achieving students. 
More wiihin-expcnmcnt comparisons of the different instructional methods 
need to be undertaken, however, before firm conclusions can be reached in 
'his regard In addition, parallel work in the area of teacher training wouid tc 
required before prolcptic instruction could be implemented on a wideswalc 
basis. 

Promising Agendas for Continued Development of Ihe 
Instructional Knowledge Base 

The preceding information has summarized many firm findings upon which 
we can begin to build merged instructional programming for low achieving 
O 
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students. In addition, nagging concerns regarding the present knowledge 
base have been discussed. In particular, the following three general con- 
cerns have been noted. (I) in direct instruction: a need to examine new 
forms that it might take in order to teach adequately higher-order cognitive 
skills; (2) in the process-product findings: the need to unpack generic 
variables in order to get at moic fine-grained guides for practice, and (3) in 
cognitive strategics research, the need to address knowledge components of 
effective strategy use and the implementation requirements of various 
instructional approaches to strategy training. 

A common feature shared by these three concerns h the need to 
address the content of instruction. In particular, the concerns highlight 
sparse attention to teaching and learning in scmantically rich domains that 
correspond to the basic academic disciplines of reading, mathematics, 
writing, science, history, and so on. We know very little about how to teach 
and how students learn the kind of material that is (or should be) the 
mainstay of the academic curriculum. 

Up to this point, the majority of instructional investigations have 
focused on the structural and delivery aspects uf cficction instruction. Much 
valuable information has been lean d in these areas. For example, monitor 
ing, feedback, and the effective use of time arc important elements of 
effective teaching, similarly, smJI gtuup instruction arranged in hierarchical 
levels of difficulty and geared to the level of the child is conducive to 
uicicascd student learning. Notably lacking, however, has been attention to 
the actual material being taught, that is, how teachers organize and deliver 
the content of academic lessons and how students come *o understand that 
material. This latter information, it seems, would be an indispensable 
prerequisite (or at leas! a corcquisitc) to decisions regarding how to anangc 
the 5tructural and delivery elements of teaching. This is not to belittle the 
importance of information on structural and delivery aspects of instruction. 
Rather, it is meant to point to the potcnti ,! power of an instructional system 
that considers both what is taught and how it is taught. 

These arc not easy questions. At present, investigations into areas 
such as these arc just beginning to take shape. This new genre of research oi 
subject matter content is informed by several lines of work. Tor example. 
curncuKim specialists are bceinning to raise new concerns about the nature 
of subject matter competence, and cognitive psychologists have begun to 
focus on the identification of knowledge and skills associated with the 
acquisition of competence in specific content domams The final section of 
our chapter highlights selected aspects of this research in three content 
areas: science, mathematics, and history. 

St lent c Research m the content area ol science has been under- 
taken by cognitive psychologists, developmental ps>cholopsts, *;nd rc 
searchers interested in the design of instruction. By and larjv, cognitive 
psychologists have focused their attention on the question of how students 
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learn scientific information, instructional psychologists, on the other hand, 
have begun to wrcstic with the practical problem of how the findings and 
insights derived from cognitive research on science learning can inform 
classroom practice. 

Investigations into the acquisition of scientific knowledge have gener- 
ally focused on two interrelated areas, the identification of the conceptions 
that uninstructcd students possess regarding varku s scientific phenomena, 
and the characterizatio*. of "knowledge-building" skills of more versus less 
able students. Several studies have shown that uninstructcd students have 
definite conceptions about various natural phenomena and th.it these 
conceptions are highly resistant to change. For example, "naive" subjects 
(ages eight fo eleven) think that light from a candle goes further at night and 
that friction only occurs between moving surfaces (Stead ar* Osborne. 
1980). It has been argued that the development of effective techniques to 
confront and modify such misconceptions must be a priority for future 
research (Erikson 1983: Rcsnick. 1983). 

More recently . the thrust of the work in this area has been aimed at the 
identification of how children !earn the extensive information associated 
with the various scientc topics included in most school science curricula. A 
characteristic of this and all research in the content areas is the maintenance 
of a clear distinction between the memorization of scientific facts and the 
development of the kind of scientific understanding that allows students to 
answer questions about complex topics that require analysis, application, 
and integration. Emphasis is placed on the latter (i.e.. the development of 
deep unJcrstanding of complex topics). For example, in a line of work being 
conducted by Chi and colleagues (e.g., Chi. Bassok. Lewis. Reimann. and 
Ghser. under review), careful attention is being devoted to the explication 
of the "knowledge-building" skills used by more versus less successful 
students when learning new science topics. Preliminary findings suggest that 
the knowlcdgc-buiiding skills used b> successful students (I Assist them in 
going beyond the information that is pre *ntcd to them. (2) help them to 
organize their knowledge in meaningful v vs. ami (3) give their knowledge 
conceptual power. Efforts are nc< tindc* to characterize the nature of 
the contents of the knowledge-building sMlis used by successful science 
students The ultimate goal is to be able to instruct less able students in the 
use of these exemplary knowledge-building skills once they have been 
identified. 

In the meantime, classroom application work is also being initiated 
instructional psychologists have begun training and implementation re- 
search programs, the goals of which include training science teasers to ( I) 
analyze student understanding of science concepts. (2) analyze science 
instructional materials, and (3) modify existing instructional materials and 
procedures. For example, Gunstonc. Ch ipagnc. and Klopfcr ( 1981 ) have 
designed and tested nnovative instructional strategics in physics that 
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include qualitative analyses of problems and ideational confrontation. 
Research-based training programs such as these represent a link between the 
basic inquiries of cognitive psychologists and the real world of the 
classroom, thereby testing ideas in practice and providing continued sug 
gestions from and for practicing teachers. 

History . Appreciation for the work undertaken in the area of history 
relics on the following awareness. Most history or social studies instruction 
delivered in our elemental y and middle schools today is heavily influenced 
by commercial textbooks. Both the content that is covered and the instruc 
tional strategics used to present the content arc deeply influenced (Shaver, 
Davis, and Hclburn. 1979). Therefore, detailed analyses of existing text 
books is one method of increasing our understanding of the current state of 
history instruction. Such an investigation of intermediate-grade history 
textbooks by Beck. McKcown. and Gromoll (in preparation) has revealed a 
serious lack explanatory elements including the significance, interrelated 
ness, and causal connectedness of historical events or facts. In other words, 
textbooks typically outline a scries of events without stating the relation 
ships between and among those events. For example, in a unit on the 
American Revolution, textbook authors might ptescnt a description of the 
Boston Tea Party . followed by a discussion of the passage of the Intolerable 
Acts, followed, in turn, by a discussion of the closing of the Port of Boston. 
Left unstated, however, arc the highly c .pic. tory causal connections 
between and among these three events — connections which, if understood, 
contribute to a deep understanding of the causes of the American Rcvolu 
tionary War. 

At a deeper level, the texts were also found lacking in their conveyance 
of *big concept" ideas (i.e. . ideas not tied to concrete events or persons, but 
rather ideas that conceptually unite a movement or a historical period (e.g.. 
the growing >cnsc of U.S. identity of the thirteen colonics, the concept of 
self-government, etc.). These findings on the superficial level of treatment of 
historical topics by textbooks arc just a first step, however. Needed arc 
further inquiries into the nature of how students learn complex histori 
cal ma>°rial and how teachers can assist them through the modification 
and supplementation of existing textbook materials and procedures. In ad 
dition. work is needed in the area of identifying key components of 
teacher explanations that lead to increased depth of student understand 
tng. 

Mathvmatus. Mathematics has been one of the more intensively 
studied content areas. Math educators, developmental psychologists, and 
(more recently) cognitive scientists have provided new insights into the 
nature of mathematical knowledge and how it is acquired. By and large, 
there ,s considerable optimism surrounding the potential of this research to 
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provide classroom teachers with a reasonable account of how various topics 
within the math curriculum arc learned by students and how they can best be 
taught. 

One branch of this research is focused on the nature of the intuitive 
mathematical knowledge and competencies of >uung. uninstructed children 
'Ginsburg, 1977: Rcsnick, 1987). A troubling question is why this informal 
knowledge docs not seem to support formal math learning in school. A 
closely allied line of work looks at the actual and potential cognitive linkages 
between "various ways of knowing mathematics." Although various re- 
searchers use different labels for these different "ways of knowing math." 
most taxonomies include ir.tuitivc or naive knowledge, computational 
knowledge (knowledge of procedural rules), concrete knowledge (knowl- 
edge of how to manipulate objects to solve a problem), and principled 
knowledge (knowledge of the conceptual underpinnings of mathematics). 
Whereas some researchers have focused on how learners connect concrete 
experiences, procedural skill, and conceptual understanding (e.g.. Rcsnick 
and Omanson. 1987). other investigators ha\c chosen to focus on the 
potential role of classroom instruction in impiovmg competency in each of 
these categories of knowledge or in strengthening the *onncctions among 
them For example. Lampcrt (1986) has described a set of instructional 
strategics for teaching multidigit multiplication to fourth graders which 
seems to promote the integration of students' knowledge of rote procedures, 
their intuitive understanding of how to operate on concrete quantities, and a 
deeper conceptual knowledge of the mathematical principles that legitimate 
the procedures. The lesson activities described (and taught) by Lampert 
were designed to give children practice in realms where various relevant 
math principles (e.g.. place \alue, the distributive property of multiplication 
o\cr addition) were transparent in the steps of the procedures used to arrive 
at an answer. For instance. Lampcrt used various grouping exercises with 
U S coins which capitalized on the parallels between the relative values of 
different kinds of coins and the way we use place value to compose 
numbers. 

Similarly . Lcinhardt has provided analy scs of the teaching and learning 
of subtraction in the second grade ( 1987) and the teaching and learning of the 
equivalence of fractions in the fourth grade (1988a.b. Lcinhardt and Smith. 
1985) A primary focus of Lcinhardt' s work has been the detailed examina- 
tion of what she consideis to be a core feature of a math instructional 
episode the explanation of new material and its logical connection to prior 
material. The aim of this work is the identification of key explanatory 
components that lead to increased student understanding. The research has 
been carried out by analyzing a scries of math lessons by an expert teacher 
and by simrltancously monitoring the growth and structure of students' 
knowledge bases as the lessons progressed. What emerges from this detailed 
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trace of instruction and student learning is a list of features to look for in an 
expert explanation of one particular topic within elemental * mathematics. 

SUMMARY AND CONCLUSIONS 

In this chapter, we have attempted to place current instructional practice 
and research into the context of the changing educational responses to the 
needs of low-achieving students. Our review has traced these changing 
responses and has characterized them as passing from neglect, to segregated 
attention, to (attempted) integration. Seen through the lens of instruction, 
we have that ted progression from an orientation marked exclusively by care 
and nurturance. to one of nonacademic skill training, to a present-day 
approach dominated by the teaching and learning of academic skills and 
facts through teacher-led. direct instruction. The advances have been great, 
yet much remains to be accomplished. Integration is still often nominal and 
much of the learning is still lowcr-ordc'r and often not easily transferred to 
new areas. 

At the beginning of the chapter we stated that it is important to take 
stock of what we know about the education of low -achieving students and 
what we stiil need to know. From the perspective of instruction and 
learning, this chapter outlines several things that we do know about effective 
instruction for at-risk students, for example, teaching children directly the 
material we want them to learn pays off. and mixed-ability small-group 
learning promotes improved achievement. Th\» chapter also points to gaps 
in our knowledge, for example, how is a specific piece of complex material 
taught and learned? How can we best foster mathematical problem solving * 
Although our coverage of the terrain has not been ccmprchcnsivc. we hope 
that we have succeeded in highlighting what is known and what is not known 
or only beginning ,to be known. 

We have also made suggestions for the direction of future work in this 
area. We hope that our overview of work being carried out under the broad 
umbrella of research in the content area:, illustrates the potential of this line 
of work for informing the instruction of remedial students, as well as the 
complementary role that findings from such work can play to our existing 
knowledge regarding general effective teaching practices. Hopefully, the 
answers to questions asked by content researchers will provide a balance to 
the pedagogical answers that have been aggregating o\cr the past twenty 
years. 

In particular, we sec research in the content areas as holding promise 
for low-achieving students in two realms. (!) the identification of the 
strategics used by more able students to Je^rn subject matter and (2) the 
identification of the key components of effective teachers' h 4 sons. The 



186 Instructional Issues 



elucidation of methods by which successful students learn subject matter 
may provide detailed guidance on how best to instruct less able students. 
The identification of key components of exemplary lessons, on the other 
hand, may begin to provide a linkage between the organization and 
substance of teacher-presented material and the structure and conUtit of 
student knowledge. In other words, we hope to learn about the nature of 
transmissions (\c.. how complex bodies of knowledge arc Inmsf rred from 
teacher to student). 

We sec both of these advances as particularly timely for movement to 
the next stage of knowledge regarding effective instruction for low-achieving 
students. This is for two reasons. First, the teaching and learning of subject 
mailer in remedial education has. by and large, been neglected. Although 
emphasis has recently been piaccd on the provision of instruction in the 
basics (i.e.. reading and math), topics such as science and social studies 
have, for the most part, remained untouched. Second, in the past we have 
(at least implicitly) condoned the adoption of two different sets of goals for 
low-achieving and regular students. A goal of general education is to 
advance students to high levels of understanding of complex material. By 
contrast, the education of at-risk students has often aimed lower, that is. to 
the mastery of lower-level facts and routine procedures ^is discrepancy in 
goals undoubtedly has multifacctcd roots. A contrir i. o factor, however, 
may be researchers' successful analyses of ihz learning requisites of facts 
and procedures, thereby facilitating the prescription of methods of teaching 
these skills effectively to low-achieving students. It is our hope that the new 
investigations into learning complex material will lead to a clearer under- 
standing of the components of competent performance in higher-order 
domains and subsequently to prescribed methods of teaching those com- 
ponents to students who lack them. 

This movement toward the study of the teaching and learning of 
higher-level topics within particular content domains, we believe, represents 
a potential breakthrough for the education of remedial students in particular. 
In the past, wc have been too easily satisfied by (apparent) gains »ji learning 
which vanished once the props were removed or the unit had passed. As wc 
learn more about how complex topics can be best taught and understood, wc 
will expand our expectations to include the ability to analyze, apply, and 
integrate material in conceptually powerful way s. Similarly . wc will demand 
instruction that helps students link new knowledge lo what they already 
know and assists them in going beyond the material presented— to reason 
about the material in principled ways. This is a significanl leap that places 
new and unfamiliar demands on both teachers and learners. Nevertheless, 
the decision to work toward these new levels of understanding represents 
the only choice if wc truly subscribe to equal educational opportunity for all 
students We have a new ally in our march toward equality, the beginnings 
of a new set of effective instructional features for low-achieving students. It 
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is our belief that this new wave of contcnt-rclcvani features holds the 
potential to promote increased depths of understanding of a 1 ! levels of 
material by all students. 
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Chapter I of the Education Consolidation and Improvement Act (ECIA) is 
the nation's largest federal education program, providing ovc* $3 billion of 
aid to local school districts for support of compensatory education pro- 
grams. Designed to increase educational opportunities and outcomes in 
school districts serving high proportions of low-income students, the Chap- 
ter I program signals the nation's commitment lo improving education tor 
disadvantaged students. Unfortunately, after two decades of continuous 
operation, evaluation research demonstrates that the program results in 
discouragingly small achievement gains for participating students. 

In 1983, Congress mandated that a National Assessment of Chapter 1 
be conducted in order to consider the current operations of the program and 
its prospects for its improvement. The Office of Educational Research and 
Improvement (OERI) of the United States Department of Education con- 
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ducted this National Assessment and is releasing three reports to Congress 
(Kennedy, Jung, and Orland, 1986; Kennedy, Birman, and Demaline, 1987; 
OERI, in press). These reports, based on secondary analyses of existing 
data as well as the results of original research, give a comprehensive 
overview of program operations at multiple levels of the educational system. 
They also provide educators and policymakers with an unprecedented 
opportunity to reassess ana improve upon current methods of delivering 
compensatory education to disadvantaged students. 

This chapter reports the results of a field study undertaken as part of 
the National Assessment. Among the original research studies sponsored by 
this initiative, the study discussed here was the only one that investigated 
the operations of the Chapter 1 program at the point of service delivery, in 
the schools and classrooms where participating students received the 
instructional services purchased by Chapter 1 funds. A major purpose of the 
research was to assess the "size, scope, and quality" of local Chapter 1 
instruction and to consider strategics for improving the program's capacity 
to "meet the special educational needs of the children being served" (sec 
Section 556 (b) (3) of Chapter 1, EC1A). 

The study focused on three issues. First, it gathered descriptive data 
on the characteristics of Chapter I instruction and compared these to the 
features of an effective instructional program as found in previous research. 
The purpose was to provide information on the quality of Chapter 1 
instruction in local schools. Second, the study investigated whether varia- 
tions in instructional quality could be accounted for by local design 
decisions, particularly decisions about the implementation of different 
models of service delivery (e.g., pullout, in-class, replacement, or add-on 
models). Finally, the study analyzed the extent and nature of coordination 
between Chapter 1 and regular instructional programs. The overall purpose 
of the study was to gather descriptive data on the design and operations of 
the program at the point of service delivery and to formulate strategics for 
improving Chapter I instruction. 



DATA AND SAMPLE 

The study was conducted in six states in diverse geographic regions of the 
country. Twenty-four schools were selected using a purposive sampling plan 
that took into account the following facts: (1) since approximately 90 percent 
of the school districts in the United States receive Chapter 1 funds 
(Kennedy, Jung, and Orland, 1986), the sample was chosen to include 
schools in large and small districts in urban, suburban, and rural commu- 
nities; (2) since districts arc more likely to provide Chapter 1 services in 
elementary than in secondary schools (Advanced Technology, 1983), the 
sample included more elementary than secondary schools; and (3) since 
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Chapter ! regulations permit schools to use a variety of service delivery 
models (e.g., pullout, in-class, replacement, add-on) the sample was chosen 
to include schools implementing the full range of service delivery models, 

The final sample included seventeen elementary schools, three inter- 
mediate schools, and four high schools. Fifteen schools were located in 
urban areas, four in suburban areas, and five in rural areas. District 
enrollments varied from a high of over 430,000 students to a low of 
approximately 900 students. Most of the schools offered Chapter I instruc- 
tion in both reading and math, although this was more true of the elementary 
than the secondary schools in the sample. Moreover, most of the schools 
used more than one service delivery model to deliver Chapter I instruction 
(e.g., pullout was used in math, in-class for reading), 

At each school in the sample, teams of three to four researchers spent a 
month conducting interviews with school staff and observing the instruction 
received by students. During the course of the study over 2,000 hours of 
academic instruction were observed, and interviews were conducted with 
over 400 teachers, administrators, and students. These procedures yielded 
descriptive data on patterns of Chapter I service delivery in schools (e.g., 
the delivery models in use, service schedules, and staffing patterns) and data 
on the characteristics of the instruction received by Chapter I students. 

The major task of the study was the observation of students' instruc- 
tion over the course of an entire school day. Only students who participated 
in the Chapter I program were observed. At each school, eight students 
were observed for an entire school day, and two students were observed for 
an entire school week. During these observations, researchers used a coding 
scheme that recorded the amount of time students spent receiving instruc- 
tion in various subjects, the instructional formats that students engaged in 
during lessons, the sizes of the instructional groups in which students 
participated, and the instructors from whom students received instruction. 
Observers also kept a running narrative record that focused on the materials 
being used for instruction , the nature and skill levels of instructional tasks on 
which students worked, the nature of verbal interactions between teachers 
and students, and the engagement and success rates of students during 
instruction. Thus both quantitative and qualitative data on students' instruc- 
tional experiences were collected over the course of a school day (or week). 

In addition to observations of Chapter I students, interviews were 
conducted at each school with classroom teachers, school administrators, 
and Chapter I staff (resource teachers, aides, and coordinators). The data 
from these interviews were used to assemble descriptions of the service 
delivery modcl(s) in use at a school, the scheduled frequency of Chapter I 
and other special instructional services, and a number of other topics not 
germane to this chapter but discussed in the final report of the research 
project (Rowan, Guthrie, Lee, and Guthrie, 1986). 

The resulting data set reflected the multilevel concerns of the research 
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project. Interview data provided information on the design characteristics of 
Chapter 1 projects in the seventeen elementary and seven secondary schools 
included in the study. And within these schools, student observations 
provided information on the characteristics of instruction received by 
Chapter I students. In the final analysis, the study obtained data on 241 
Chapter 1 students (166 in grades 2 and 4; 75 in grades 8 and 10) and 
observed a total of 2,062 hours of academic instruction (1,357 hours in 
grades 2 and 4; 705 hours in grades 8 and 10). 



RESEARCH AND POLICY ISSUES 
ADDRESSED BY THE DATA 

The data were used to provide information on a number of questions about 
the design and implementation of Chapter 1 instructional programs in local 
settings. This section reviews these questions and shows how they arc 
related to a number of research and policy issues in the area of compen- 
satory education. 



Quality of Instruction in Chapter 1 Projects 

The National Assessment of Chapter I was designed to gather information 
on the quality of instruction received by compensatory education students. 
In order to meet this goal, this study needed to develop an approach to 
gathering data that differed from much past evaluation research, particularly 
the large-scale outcomes studies that often have been used to evaluate the 
effectiveness of the Title I and Chapter 1 programs. These studies often take 
a macro approach to the evaluation of federal education programs. Re- 
searchers assume that all students who participate in a federal program 
receive comparable instructional treatments and thus that evaluations of 
program effectiveness can be accomplished by inspecting highly aggregated 
data on instructional outcomes. A number of observers have noted the 
shortcomings of this approach. A problem is that the instructional "treat- 
ments" received by compensatory education students vary markedly from 
schooi to school, contrary to the assumptions of the macro approach to 
evaluation (Avcrch, Carroll, Donaldson, Kicsling, and Pincus, 1972; Wilcv 
1979; Carter, 1984). 19 
Mindful of this criticism, researchers have begun to advocate a micro 
approach to the evaluation of federal education programs. In this approach, 
the school-to-school variation in instructional programs is recognized and 
described in detail. This study follows this tradition of evaluation research. 
Data were collected on the characteristics of instruction provided to Chapter 
I students at schools in the sample, and these data were compared to the 
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features of an effective instructional program identified by past educational 
research. The major purpose of the analysis was to examine the processes 
within schools that lead to instructional outcomes and to describe the extent 
of variation in quality of instruction in a variety of local settings. 

The definition of quality instruction used in this study was derived 
from past research on teaching and instruction On the basis of this 
literature, the following variables were identified as components of an 
effective instructional program: 

Time. Educational research has shown a consistent relationship 
between the amount of time students spend on academic tasks and their 
subsequent performance on achievement tests (Walberg and Frederick, 
1983). The relationship of time to student achievement is greater in studies 
that measured engaged time rather than allocated time. Engaged time is that 
fraction of allocated time that students spend actively working on academic 
tasks (Fisher, Berliner, Filby, Marliave, Cahen, and Dishaw, 1980). The 
present study recorded the amount of time students in the sample spent in 
instruction in various subjects in both the regular and Chapter I programs. 
In addition, qualitative data on student engagement and success were 
gathered. These data were used to assess the extent to which Chapter I 
instruction contributed to students* academic learning time. 

Class Size. Past research also indicates that student achievement is 
increased when learning activities take place in smaller groups (Cahen, 
Filby, Mc^utcheon, and Kyle, 1983). For example, a meta-analysis of 
studies of class size by Glass, Cahen, Smith, and Filby (1982) presented a 
curve that traced the effects on learning of reductions in group size. This 
curve suggested that reductions in class size had minimal effects until 
instructional groups reached a size of about ten students. Below this 
number, reductions in class size tended to have larger effects. This same 
meta-analysis also suggested that reductions in class size had larger effects 
when the reduction occurred for longer periods of time. For example, Glass 
and associates (1982) arbitrarily divided studies into those that reduced 
group size for more or less than 100 hours and found that reductions lasting 
longer than 100 hours had larger effects than those lasting less than 100 
hours. The present study recorded the sizes of the instructional groups in 
which students in the sample participated, both in the Chapter I program 
and in the regular program. These data were then used to examine whether 
changes in student grouping arrangements could be expected to contribute 
to increased achievement of Chapter I students. 

Instructional Formats. A third component of instructional quality 
consists of the formats used by teachers during lessons. A great deal of 
research has searched for instructional formats that result in effective 
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instruction for low-income, low-achieving students (for a review, sec 
Brophy and Good, 1986). In the 1970s, researchers held out high hopes for 
individualized instructional formats, but the Instructional Dimensions Study 
(Cooley and Lcinhardt, 1980), sponsored by the National Institute of 
Education (NIE) during its last evaluation of compensatory education (N1E, 
1976), provided little support for the effectiveness of this approach, at least 
as measured in the study. Alternatively, much more empirical support has 
been founri for an approach that has come to be known as "direct 
instruction" (Brophy and Evertson, 1974; Good, 1978; Stallings and Kasko- 
witz, 1974; for a review, see Roscnshinc, 1983). In this approach, teachers 
actively present lessons and provide students with guid:d practice in new 
academic skills. This approach contrasts sharply with the frequent use of 
independent scat work as an instructional format, a feature common to many 
individualized programs. Although good instruction always includes some 
independent practice, and this kind of practice usually occurs during 
seatwork, recent research suggests that an ovcrrcliancc on seatwork, 
especially its use to present new skills, is less effective than more "direct" 
instructional formats (Anderson, Brubakcr, Allcman-Brooks, and DufTy, 
1985; Brophy and Good, 1986). On the basis of these findings, the present 
study recorded the amount of time students spent in independent seatwork 
as opposed to more "direct" instructional formats such as lecture/recitation 
activities. 



Curriculum Content. Discussions of instructional quality must con- 
sider not only how students arc taught, but also what they arc taught (Carter, 
1984; Cooley and Lcinhardt, 1980). Increasingly, thoughtful observers arc 
beginning to question the curriculum content of compensatory education 
programs (Botcl, 1978; Allington, Stcutzcl, Shake, and Lamarchc, 1986). 
Past research suggests that compensatory students spend much time work- 
ing on "lower-order" academic skills. For example, students practice 
phonics skills but do little reading of connected text, or students practice 
basic arithmetic skills but do not apply these skills in problem-solving 
situations. Recent research also suggests that the "direct" instruction 
formats that many educational researchers advocate for use with low- 
income, low-achieving students may be of limited utility for instruction in 
higher-order thinking skills (Peterson. 1986). As v result of this debate, this 
study recorded the skills to which students in the sample were exposed, both 
in the Chapter 1 program and the regular program, and the study assessed 
the extent to which Chapter I instruction was focused on low-level basic 
skills, such as phonics drills and arithmetic facts, or whether Chapter I 
students had an opportunity to engage in higher-order skills, such as the 
reading of connected text and the completion of problem-solving exer- 
cises. 
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The Design and Implementation of Instructional Services 

The study was not simply interested in charting the characteristics of 
Chapter I and regular instruction in a diverse sample of schools. An 
additional purpose was to assess the extent to which instructional quality 
varied as a result of the Chapter I service delivery model used by schools. In 
particular, the study sought to describe the types of project designs used in 
schools and to assess the effects of these designs on the scope and quality of 
instruction received by Chapter 1 students. 

Questions about local design practices often focus on a specific issue: 
the relative merits of implementing pullout versus alternative models of 
service delivery. Early research on this question found that almost all Title I 
projects used pullout models (Glass and Smith, 1977), but later research 
discovered a slight trend away from this tendency, with school districts 
increasingly replacing pullouts with alternative designs involving in-class 
and replacement designs (Advanced Technology, 1983). This study was 
particularly interested in whether the implementation of these different 
service delivery models had consequences for the scope and quality of 
instruction received by Chapter 1 students, since it was felt that evidence on 
this point could help policymakers and practitioners better understand the 
instructional consequences of local choices about service delivery models. 

Much early research suggested that the use of pullout models was 
detrimental to instruction. Glass and Smith (1977, p. 5), for example, argued 
that 'research docs not support the wisdom of instruction under conditions 
like those that prevail in pullout programs/ 1 Kimbrough and Hill (1981) 
expanded on this critique when they argued that pullouts disrupted ongoing 
lessons in regular classrooms and caused students to miss some portion of 
their regular instruction. Other research suggested that the implementation 
of pullout designs can result in a lack of coordination between compensatory 
and regular instructional programs and that this can adversely affect student 
success in regular classroom lessons (Johnston, Allington, and Afflcrbach, 
1985). 

Other researchers have developed a more balanced analysis of service 
delivery models. Archambault (1986), for example, reviewed a number of 
studies of the effects of pullout models on instruction and found that study 
results were inconsistent. He concluded that choice of a particular delivery 
model was less important to the quality ;*nd effectiveness of Chapter 1 
instruction than a number of other factors, induing curricular, staffing, 
grouping, and teaching practices. Archambault's work suggests a broader 
view of project design that includes a number of factors in addition to the 
particular service delivery model being implemented locally. 

Past evaluations of compensatory education suggest further considera- 
tions about local design practices. Carter (1984) noted that project designs in 
compensatory education arc characterized by few uniformities. In part, this 
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variability in design is due to the weak constraints placed on schools in 
federal education laws and policies. As Gaffney (1986) discussed, Chapter 1 
legislation and federal education statutes give local school systems wide 
latitude in the design of local Chapter I projects. In addition to allowing 
schools to implement a num* * of different service delivery models (e.g., 
pullout, in-class, replacement, add-on), federal statutes prohibit the federal 
government from exercising any direct supervision or control over the 
curriculum, program of instruction, administration, or personnel of any 
school system (Gaffney, 1986). Given these circumstances, it would not be 
surprising to find that schools operating the same nominal service delivery 
model have few other instructional design features in common. 

Given the lack of restrictive guidelines about local project design, there 
is a need to investigate the extent to which schools uniformly implement 
various service delivery models and to examine service delivery models in 
the context of other instructional design features. Accordingly, the sample 
for this study included schools that used a variety of service delivery 
models, but these models were nested within sites that also contained a 
variety of other design features, including a variety of curriculum, staffing, 
scheduling, and management practices. This sampling strategy allowed an 
investigation of the extent to which projects using the same nominal service 
delivery model (e.g., pullout) were similar in other design features. It also 
allowed an analysis of the extent to which overall project design features 
affected the scope and quality of instruction received by Chapter 1 students. 
The purpose was to provide practitioners and policymakers with a better 
understanding of the instructional implications of design choices. 

Chapter 1 and the Regular Instructional Program 

A final set of questions concerned the relationship between Chapter 1 and 
regular instruction. Obviously, student achievement ultimately results from 
the combination of instruction in both of these programs, and most educa- 
tors view compensatory education as a supplement to a student's regular 
program of instruction. Thus an important topic of concern in this study was 
how Chapter I instruction fit within a student's overall instructional 
program. 

The last major evaluation of compensatory education (NIE, 1976) 
contained a number of findings relevant to this issue. A basic picture that 
emerged from this evaluation was that Title I instruction often substituted 
for, rather than added to, students* regular instructional programs. For 
example, the NIE (1976) study found that the average compensatory 
education student spent between 4 and 5Vi hours a week in compensatory 
instruction, almost always after having been "pulled out" of the regular 
classroom during which time about 40 percent of participating students 
missed instruction in a variety of regular classroom subjects, 
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Policy analysts have discussed these findings in conjunction with 
criticisms of current program practices. For example, Brown 0982) and 
Walberg (1984) argued that compensatory education programs arc not truly 
compensatory because they rarely add more instructional time to a student's 
instructional day. Almost all projects offer instruction during the regular 
school calendar and participating students often miss some portion of 
regular classroom instruction. A closely related criticism was offered by 
Kimbrough and Hill (1981), who argued that the widespread use of pullout 
models disrupted the instruction not only of pulled-out students, but also of 
students who remained in the classroom. 

It is interesting to contrast these criticisms, offered by policy research- 
ers and academics, with the views of teachers, lr. 1978, the NIE sponsored a 
conference in which teachers were invited to discuss research on compen- 
satory education. By and large, teachers were not much concerned with the 
fact that compensatory instruction caused some students to miss a portion of 
the regular curriculum. Some argued that mastery of basic reading and 
mathematics skills was paramount. Nevertheless, teachers at the conference 
did recognize that coordination probiems existed, especially when pullout 
models were in us* and many teachers reported that they "saved" time for 
instruction in important subjects until it could be offered when all students 
were present (sec Advanced Technology, 1983, pp. 5-30 for survey findings 
that confirm teachers* views). 

It is important to consider not only what students miss when they 
receive Chapter I lessons, but also the extent to which lesson content in 
Chapter I and regular classrooms is congruent. For example, Johnston, 
Allington, and Aftlcrbach (1985) concluded that most compensatory educa- 
tion students received compensatory reading lessons that bore little relation- 
ship to the reading lessons in their regjlar classrooms. They also reported 
that school personnel made little effort to systematically coordinate lesson 
content across instructional prograin^On the basis of this and other 
evidence, they concluded that students would be more successful in the 
regular instructional program if they received compensatory instruction that 
was more congruent with what was offered in the regular program. 

To investigate these issues, this study observed students over the 
course of an entire school day, a procedure that allowed us to chart the 
congruence of lessons across different programs. In addition, we used 
observations and interview data to address other relevant questions. For 
example, we were interested in understanding more about the problem of 
missed instruction. As the NIE (1976) study showed, only 40 percent of the 
teachers surveyed reported that Chapter I students missed some portion of 
their regular instruction. In this study, it was possible to analyze the extent 
to which school-level design decisions affected the problem of missed 
lessons. Moreover, the interview data allowed an investigation of how 
teachers managed instruction so that the disruption of Chapter 1 instruction 
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was minimized. And finally, the data were used to investigate how regular 
classroom teaches, Chapter 1 staff, and school administrators coordinated 
instructional programs and whether various coordinative procedures re- 
sulted in lessons that were more or less congruent across programs. 



FINDINGS 

A mass : vc amount of descriptive data was collected over the course of the 
study, including qualitative and quantitative data on schools, classrooms, 
and students. The approach to the analysis of data followed procedures 
developed in other multisitc case studies (Miles and Hubcrman, 1984). 
Analysis began on a casc-by-case basis using a variety of data reduction 
forms, resulting in the writing of extensive case studies of twelve schools 
(Lee, Rowan, Allington, Anderson, Bossert, Harnischfcgcr, and Stallings, 
1986). Analysis then proceeded to cross-site comparisons and the develop- 
ment of important generalizations. The findings from this analysis were 
reported in the final report of the project (Rowan, Guthrie, Lee, and 
Guthrie, 1986). 

This chapter summarizes the major findings from the cross-site analy- 
sis in three general areas: ( I ) the quality of instruction received by Chapter I 
udents in the sample; (2) the degree to which quality of instruction varied 
as a function of the design characteristics of schools, particularly the service 
delivery models in use; and (3) the relationship between Chapter I instruc- 
tion and regular instructional programs. 

Quality of Instruction 

Basic descriptive data on the quality of Chapter I instruction at the 
twenty-four schools in the study are presented in Tables 7-1 through 7-3. 
The data have been aggregated to the school level, and schools have been 
grouped by service delivery model (for elementary school reading projects, 
elementary school math projects, and secondary school reading and math 
projects). The first column in each table lists the number of days Chapter 1 
instruction in a particular subject was observed at a school. The second 
column lists the number of times per week students were scheduled to 
receive Chapter I instruction. In most schools this schedule was fixed, but in 
some schools with in-class designs, scheduling was variable and it was 
necessary to infer a schedule from the observed data. The next two columns 
combine the data on schedules with the data on service minutes to yield 
estimates of the weekly and yearly time the average Chapter I student at a 
school spent in Chapter 1 services. 

In addition to data on time allocations, the tables include data on three 
other variables. One is the percentage of observed Chapter I instruction 

r > ci 



TABLE 7-L Characteristics of Chapter 1 Instruction: Elementary School Reading Projects 



Estimated ~ , * 

„ , , . Vc Instructor* 

Service 0 Service toer. Weeklv Yearly* _ . ^. Aver. ~ , „ 

Delivery Days minsl Tune Time T >P" nCI Group Size * in tormatst 



Model School Observed Schedule Servue day Slim. Hr\. CT RT A in CI led Ret Siuk Mffmi Surr 



Mixed 



Pullout 



ln*iltt%\ 



RepUu ement 



Parker 


9 


2x*5x/wk« 


30.7 


26wk/62 


52 


0 


1 


99 


<9 


42 


38 


7 


13 










IOuk/155 




















WcstVVOOd 


14 


5x/wk* 


31 4 


157 


94 


13 


10 


70 


4 5 


S2 


19 


7 


17 


Hayes 


17 


5x/wk' 


42.1 


210 


126 


3 


71 


27 


1 9 


67 


24 




1 


Kensington 


18 


5x'wk (9wks> 


133.1 


665 


100 


1 


58 


34 


5 7 


40 


W 


7 


0 


St. Mary's 


II 


3x/wk 


29.3 


87 


52 


0 


85 


15 


S S 


82 


0 


6 


12 


Danville 


14 


4x/wk 


50.1 


200 


120 


7 


73 


20 


66 




18 


6 


10 


Central 


15 


5\/wk 


32.2 


160 


96 


1 


39 


60 


* 2 


79 


10 


6 


5 


Hillside 


i5 


4x/wk 


46.5 


188 


113 


10 


64 


26 


4 1 


H 


8 


IS 


41 


Johnson 


15 


4xMk 


30.9 


124 


74 


1 


75 


21 


6 S 


78 


14 


6 


0 


Winkler 


14 


5x/wk 


2K.4 


140 


84 


0 


67 


Jl 


1 1 


S9 


19 


2 


0 


Tudor 


4 


Sx/wk 


25 6 


130 


78 


3 


97 


0 


3 8 


12 


44 


10 


34 


Huxley 


4 


2.5/wM 


25 S 


65 


39 


13 


0 


87 


3 6 


6S 


10 


24 


0 


Nelson 


17 


5x'wk 


28 7 


145 


87 


0 


6 


94 


10 


44 


41 


6 


0 


Sumner 


9 


4*>vvk$ 


21.3 


84 


50 


0 


0 


UK) 


1 7 


94 


0 


0 


0 


Evergreen 


10 


5x/nk 


21,5 


110 


66 


10 


0 


70 


68 


S4 


IS 


0 


0 


Lowell 


II 


5x/wk 


41.2 


20S 


123 


0 


J 


97 


4.3 


% 


14 


7 


3 


Washington 


IK 


5x/wk 


11 1.4 


555 


333 


90 


0 


10 


9.1 


so 


35 


12 


2 



Legend CT= Classroom Teacher, RT* Resource Teacher, A Aide, Lcc. Rcc Lecture, Recitation, Stwk * Scat work, Mgmt* Management, 
Su it » Surrogate 

* Yearly timc»(Wcckly time x 36 weeks) - 60 minutes 

* Percentage may not equal 100 because Testing and Other formats arc not included 

1 Percentage may not equal 100 because Classroom Teacher & Aide and Othc Instructor categories arc not included 

1 Time variable— Schedule estimated from observational data 

1 Time variable— -Estimatc«26 weeks at 2x/wk and 10 weeks at 5x/wk 



TABLE 7-2. Characteristics of Chapter I Instruction: Elementary School Math Projects 



Service 




# Service 




Aver. 


Estimated 
Weekly Yearly 9 


# Instructor* 


A Ver. 










Delivery 




Days 




mitts f 


Time 


Time 


Type in CI 


Group Size 
in CI 




C A in Formotst 




Model 


School 


Observed 


Schedule 


Service day 


Sims. 


Mrs. 


CT 


RT 


A 


/ ct iRa 


Slwk 


Mftmt 


Surr 


Mixed 
































Huxley 


7 


2x/wk* 


it. it 


56 


34 


26 


28 


46 


6.5 


53 


4 


2 


22 






s - 


• X' wk 


1A A 

lo.i) 


36 


22 


0 


45 


55 


13.5 


16 


25 


II 


30 




West wood 


8 


3x/wk l 


14.1 


42 


25 


0 


0 


100 


7.9 


31 


19 


9 


42 




Nelson 


2 




lit A 


140 


84 


0 


100 


0 


3.0 


16 


64 


11 


0 


Pniiiittt 




























iyUNVIMC 


i 


J V /til' 


GO.? 


244 


146 


22 


78 


0 


4.2 


36 


37 


1 


7 








2 5x/vvk 


»*.o 


58 


35 


3 


5 


92 


5.1 


76 


0 


24 


0 




Hillside 


7 


4x/wk 


43.3 


172 


103 


0 


100 


0 


18 


2S 


13 


16 


41 




Johnson 


9 


4x/wk 


40.2 


160 


% 


22 


30 


48 


5.0 


54 


31 


7 


0 




Winkler 


n 


5x/*k 


24.8 


125 


7S 


3 


36 


58 


1 6 


62 


12 


5 


17 




Tudor 


i.i 


5x/uk 


38.4 


192 


115 


0 


100 


0 


19 


7 


92 


1 


0 


ln-tht\s 




























Sumner 


ii 


4x'v*k 


30.3 


129 


72 


27 


0 


73 


7 2 


41 


53 


3 


0 




Lowell 


6 


5x/wk 


23 2 


115 


69 


5 


0 


9S 


1 7 


44 


18 


1 


17 


Replacement 




























Washingion 


17 


5x/wk 


25.1 


I2S 


7S 


96 


0 


4 


94 


26 


59 


II 


0 




tivcrgrccn 


14 


5x/wk 


55.5 


2K0 


168 


77 


0 


21 




41 


28 


13 


IK 



Legend- CT-Classroom Teacher. RT^Rcsource Teacher, A -Aide. Let. Rcc = Ucturc/Rcutation. Stwk-ScaUork. Mgmt = Management, 
Surr- Surrogate 

* Yearly timc«(Wcckly r'mc x 36 weeks) + 60 minutes > 1 j 

' Percentage may not equal 100 because Testing and Other formats arc not included 

1 Percentage may not equal 100 because Classroom Teacher & Aide and Other Instructor categories arc noi included 
1 Time variable — Schedule estimated from observational data 



TABLE 7-3. Characteristics of Chapter ! Instruction: Secondary School Reading and Math 



Service 
Delivery 
Model 



School 



0 Service 

Days 
Observed 



St hedule 



Aver, 
tnmsi 
Service day 



Estimated 
Weekly Yearly* 
Time Time 
Slins. Hr\. 



Vt Instnutor* 
Type in CI 

CT RT A 



Aver. 



*7t in Format \t 



in a 


LaiRec 


5'M 


Mxmt 


Sttrr 




8 


20 


6 


61 


2.9 


17 


65 


8 


7 


15.7 


1 


50 


42 


0 


V 1 


17 


55 


4 


21 


o i 


ys 


9 


0 


57 


4 1 


9 


47 


10 


30 


8 0 


65 


5 


23 


6 


4 1 


«1 


il 


^ 


17 


II 8 


i 




41 


0 


12 6 


<l 


42 


7 


0 




i 


0 


0 


19 



Grade 8 
Reading 

Pultout 

Pullout 

Replacement 

Replacement 
Grade W 
Reading 

Mixed 

Replacement 
Replacement 

Grade 8 

Math 
Pullout 
Replacement 
Replacement 

Grade 10 

Math 
Mixed 



Einstein 


4 


2x/wk 


35.8 


72 


43 


6 


94 


Kchoc 


12 


2.5x/wk 


40.8 


103 


62 


0 


82 


Lake view 


13 




v>4. / 


325 


195 


95 


0 


Taylor 


6 


5x/wk 


56.2 


280 


168 


2 


81 


Salvador 


5 


l.5x/wk* 


12.0 


18 


1! 


0 


0 


Coolidge 


16 


5vwk 


29.3 


145 


87 


2 


% 


Stevenson 


16 


5x/wk 


39.8 


200 


120 


7 


M 


F.instcm 


3 


2x'wk 


29.7 


59 


IS 


37 


61 


t.akcvicu 


14 


<\ wk 


16 6 




III 


93 


0 


Taylor 


H 


<\'Wk 


57 1 


285 


171 


0 


91 


Salvador 


5 


2x/wk* 


14.0 


28 


17 


0 


0 



100 
0 



ERIC 



Legend' CT=Classroom Teacher; RT*Rcsourcc 'Icachcr; A-Atdc. Lcc/Rec- Lecture/Recitation, Stuk Scatwork. Mgmt = Management- 
.Surr=Surrogale c * 

• Yearly timc=(Weckly time x 36 weeks) - 60 minutes 

* Percentage may not equal 100 because Testing and Other formats arc not included 

I ^ crc cnlagc may not equal 100 because Classroom Teacher & Aide and Other Instructor categories arc not included 
Time variable— Schedule estimated from observational data 



208 The Quality of Chapter I Instruction 



conducted by classroom teachers, resource teachers, or instructional aides. 
The next two variables arc measures of instructional formats. The first is the 
average size of the instructional groups in which Chapter 1 students 
participated during Chapter I instruction: the second is the percentage of 
Chapter I instructional time that students spent in different instructional 
formats (these percentages may not total 100 since testing and other formats 
arc not included in the table). 

The data confirm that the characteristics of Chapter I instruction 
varied greatly across schools, even those using the same nominal service 
delivery model. Nevertheless, we begin by describing the modal patterns in 
the data. At the elementary level, in both reading and math, the modal 
pattern was to offer Chapter I instruction four to five times a week for thirty 
to forty minutes per session. Across all schools. Chapter I lessons in reading 
in grades 2 and 4 averaged between 50 percent to 60 percent lecture/ 
recitation, this study's measure of "direct" instruction. Chapter I math 
lessons at this level provide somewhat less "direct" instruction, averaging 
between 35 to 45 percent lecture/recitation across all schools. Finally, the 
one uniform charactcritistic of both Chapter I reading and math lessons at 
the elementary level was the .small si/e of instructional groups. In both 
reading and math. Chapter I group sizes averaged from five to seven 
students. 

Fewer students and schools were observed at the secondary level, and 
variations across Chapters 1 projects was great, thus making generalizations 
more tenuous. The modal pattern was to offer Chapter I instruction in both 
reading and math five times per week, usually during a scheduled period of 
the school day. The length of this period varied across schools, depending 
on school schedules, but a period usually was between forty and fifty 
minutes in length. With respect to the amount of "direct" instruction 
observed, a bimodal tendency emerged. In schools that used self-paced, 
individualized curricula, secondary school students often spent a vast 
majority of their Chapter 1 time in independent scatwork. In these schools, 
"direct" instructional formats, such as lecture/recitation, were observed 
only about 10 percent ofthc time. On the other hand, some secondary school 
projects used a more conventional teaching technique which consisted of 30 
to 40 percent lecture/recitation, this study's measure of "direct" instruc- 
tion . Finally, in some secondary schools. Chapter 1 group sizes in reading or 
math were above ten students, but across all secondary schools in the 
sample. Chapter 1 group sizes averaged between six to seven students. 

An interesting question is whether the instruction received by students 
in Chapter 1 settings differed substantially from that received in regular 
classrooms. Table 7-4 presents data comparing lesson formats and group 
sizes across the two programs for students in the sample. At the elementary 
level, the mix of instructional formats across the two program settings is 
roughly comparable, although Chapter 1 lessons appear to consist of roughly 
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TABLE 7-4, Percentage of Time Spent in Different Instructional Formats by 
Subject and Grade Level 
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10 percent more "direct" instruction and somewhat more surrogate (e.g., 
computer-assisted) instruction than regular classrooms. As discussed above, 
at the secondary level, the Chapter 1 projects at schools in the sample 
provided a bimodal distribution of instructional formats, in the schools with 
self-paced, individualized curricula in reading (Lakcview and Taylor), 
students were provided with much less "direct 0 instruction in Chapter I 
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lessons than in regular lessons. At the tenth grade, only one school in the 
sample offered Chapter I instruction in math, and in this school, 78 percent 
of Chapter I instructional time was given over to testing. Apart from these 
schools, however, group-paced instructional formats were used to provide 
Chapter I instruction, and in these schools there was little difference in the 
amount of "direct" instruction across programs. 

Table 7-4 docs show a major difference between Chapter I and regular 
lessons, however. At both the elementary and secondary levels, and in both 
reading and math. Chapter I group sixes were much smaller than those in the 
regular program. Indeed, based on prior studies of class size, it seems safe to 
conclude that the average size of instructional groups in Chapter I settings 
represented an important advantage of Chapter I over regular classroom 
instruction. For examole, across all schools in the sample (i.e., both 
elementary and secondary schools) instructional groups in regular reading/ 
language arts had an average size of 17.6 students and Chapter 1 groups had 
an average size of 6.6 students. In math, regular classroom groups averaged 
20.2 students and Chapter I groups averaged 7.7 students. In fact, an 
examination of Tables 7-1 through 7-3 demonstrates that small group size 
was the most uniform characteristic of Chapter i instruction. 

This finding is especially important and requires further interpretation. 
As meta-analyses have shown, the magnitude of reductions in group sizes 
between Chapter 1 and regular classrooms at schools in the sample were 
substantial enough to expect the reductions to affect student achievement 
positively. However, as these same meta-analyses demonstrate, group size 
reductions have larger effects the longer the period of time in which students 
experience such reductions. As Tables 7-1 through 7-3 show, most of the 
Chapter I students in this study could be expected to receive less than 100 
hours of Chapter 1 instruction over the course of a thirty-six-weck academic 
year. But if students experienced longer periods of Chapter I instruction, it 
seems reasonable to expect that any positive effects of the reduced group 
sizes purchased by Chapter 1 funds would be enhanced. 

A final question investigated by this study was the extent to which 
Chapter 1 instruction offered students opportunities to practice "higher- 
order" skills. In general, the narrative records from classroom observations 
showed that Chapter I reading and math projects did not focus on these 
kinds of tasks. In math. Chapter I students in both elementary and 
secondary schools worked primarily on computational tasks involving basic 
arithmetic facts. Word problems were common, but they did not constitute 
the core of instruction, and little attempt was made to engage students in 
tasks that required the use of mathematical models to synthesize or evaluate 
ideas. In reading, the general pattern also was for Chapter I lessons to focus 
on lower-order tasks. With a few notable exceptions, students at all grade 
levels spent a good deal of time on worksheets that involved little reading of 
connected text. The low level of the Chapter I curriculum undoubtedly 
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served a useful purpose by providing students with useful review and 
practice of basic skills, especially at the lower grades, but as students 
entered upper elementary school and passed into secondary schools, the 
continuing focus on basic reading skills appeared particularly incongruent 
with the reading tasks demanded in regular classrooms. This is unfortunate, 
for in schools where Chapter 1 classes did provide direct instruction in 
reading comprehension, the narrative records indicated that many Chapter I 
students were capable of performing higher-order tasks. Thus the low level 
of Chapter I reading instruction often prevented local projects from present- 
ing Chapter 1 students with challenging materials that extended and enriched 
learning. 



Design Characteristics and Quality of Chapter 1 Instruction 

As Tables 7-1 through 7-3 demonstrate, there was much variation around 
the modal patterns of instruction discussed above. An important purpose of 
this study was to investigate the extent to which this variation could be 
accounted for by differences in instructional design features at the various 
schools in the sample. A basic finding emerged from this analysis. Contrary 
to the assumption of much of the policy literature in compensatory educa- 
tion, there was little evidence in this study that instructional quality varied 
as a result of the particular service delivery model used by a local Chapter I 
project. Instead, other design features appeared more important. 

Consider the data on the effects of different service delivery models 
(e.g., pullout, in-class, replacement, add-on). Past research led us to be 
particularly interested in the question of whether pullout programs offered 
"inferior" instructional services. At the elementary level, there was no 
indication of this. On the whole, pullout projects allocated about the same 
amount of time to Chapter I instruction, provided the same small group size, 
and were characterized by the same amount of "direct" instruction as 
projects with in-class and replacement designs. A similar pattern was 
evident at the secondary level. 

Only the add-on designs observed at the elementary levels appeared to 
provide an advantage over alternative designs, and then only on one 
dimension of instructional quality; instructional time. In schools with pullout 
and in-class designs, it was common for Chapter I instruction to constitute 
roughly 30 to 40 percent of the total reading instruction received by Chapter 
I students. However, these delivery models rarely added more than ten to 
fifteen minutes to the daily time spent by students in reading and math, and 
this added time often came at the expense of time taken away from other 
academic subjects. In schools with replacement projects, Chapter I instruc- 
tion generally constituted 100 percent of the instruction received by students 
in reading or math, and thus these projects added little or no time to 
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students' instructional days and resulted in little or no redistribution of time 
across academic subjects. Thus, overall it appeared that most Chapter 1 
projects added no time to a student's instructional day and resulted only in a 
very modest redistribution of time across subjects. Only the add-on projects, 
which offered Chapter 1 instruction outside of the regular school schedule 
(before or after school), appeared to actually add instructional time in basic 
skills without taking time away from other subjects (sec Rowan ct ah, 1986, 
chapter 6 for a more detailed analysis of these findings). 

Service delivery models also differed on another dimension: staffing 
patterns. However, this did not appear to markedly affect the cuality of 
instruction received by Chapter I students, at least as measured by 
quantitative variables. For example, in-class projects in the sample were 
staffed almost exclusively by instructional aides, whereas pullout and 
replacement models were often staffed by a combination of resource 
teachers and aides. To examine whether there were differences in the type of 
instruction offered by various types of instructors, we compared the 
instructional formats of lessons taught by resource teachers and aides. The 
analysis revealed no consistent differences. In reading, aides provided 15 
percent more "direct" instruction (lecture/recitation); in math, aides pro- 
vided about 5 percent less "direct" instruction. The narrative field records 
suggested that resource teachers gencraily provided well-organized lessons, 
used effective teaching practices, and kept students motivated and engaged 
in lessons. There were also many excellent instructors among the aides 
observed in this study, but there was also more variation in quality across 
aides; a few aides presented confusing and wrong information to students or 
were unable to keep students motivated and engaged in lessons. On the 
whole, however, it did not appear that staffing patterns made a significant 
difference to the instruction received by Chapter I students. 

The field records were also used to evaluate the criticism that pullout 
instruction wastes time for pulled-out students and disrupts ongoing instruc- 
tion in the regular classroom. To examine this criticism, a comparison was 
made between five elementary schools where all observed Chapter I 
instruction was offered in a pullout setting with two elementary schools that 
used an in-class setting exclusively. In the two in-class projects, transition 
times to and from Chapter 1 lessons averaged 2.28 minutes and ! .47 minutes. 
Of the five pullout projects, three had average transition times of about 3.5 
minutes, one had an average time of 5.7 minutes, and the last had a transition 
time of 9.0 minutes. Clearly, the amount of time spent traveling to and from 
Chapter I services was less in the in-class projects, but only by one or two 
minutes in most cases. Transition times were lengthy only in schools where 
the Chapter 1 pullout rooms were at considerable distance from the regular 
classroom. Finally, the field records indicated that the movement of 
students into and out of classrooms was no more disruptive than movement 
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within classrooms, and in elementary schools especially, students seemed 
quite accustomed to the movement and regrouping of students. 

Design decisions other than the choice of a particular service delivery 
model appeared to have larger effects on quality of instruction in Chapter I 
programs. For example, schools used different curricula, and this appeared 
to have marked effects on the content covered by students, especially 
opportunity to practice higher-order skills in reading and math. In fact, 
variables measuring content covered have been found to affect achievement 
in at least two previous studies of compensatory education, the Instructional 
Dimensions Study (Coolcy and Lcinhardt, 1980) and the Sustaining Effects 
Study (Carter, 1984). Thus curricular decisions, which were unrelated to the 
use of a particular service delivery model, appeared to have important 
effects on instructional quality. 

In addition, the adoption of curricula designed for self-paced instruc- 
tion through sequenced curriculum hierarchies appeared to have marked 
effects on the instructional treatments received by Chapter 1 students. This 
is because these kinds of curricula often resulted in a heavy reliance on 
student seatwork. As the Instructional Dimensions Study (Coolcy and 
Lcinhardt, 1980) demonstrated, the use of individualized, sequenced cur- 
ricula, in and of itself, docs not appear to be related to student achievement. 
What is needed, apparently, is active instruction by teachers. In fact, this 
kind of "direct" instruction was found by the Sustaining Effects Study 
(Carter, 1984) to be related to improved instructional outcomes. 

Examples of Chapter 1 classrooms relying on individualized curricu- 
lum hierarchies and seatwork were particularly evident in the secondary 
schools in the sample. A particularly interesting case is Lakcvicw school 
(sec the data in Table 7-3). At this school, in both reading and math classes, 
students entered a classroom with twelve to fifteen other students and 
immediately began working independently on the individualized curriculum 
packets. Although aides circulated and monitored student seatwork, the 
majority of actual instruction was offered to students by the tcsourcc 
teacher, who sat at a desk near the front of the room and conducted 
"staccato" recitations with students. As Table 7-3 shows, students at this 
school received only I to 3 percent "direct" instruction. In addition, about 
40 percent of student time was spent in management, as students waited for 
aides to check their assignments or for the resource teacher to meet with 
them. Although this case is extreme. Table 7-3 shows that most of the 
secondary schools in the sample operated this kind of individualized 
program and offered very little "direct" instruction to students. In all 
likelihood, such individualization was a convenient grouping strategy in the 
face of the wide variation in student achievement levels in secondary school 
Chapter I classrooms, but there is little evidence to support the efficacy of 
this kind of approach. 
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The Relationship Between Chapter 1 and Regular Instruction 

A final issue addressed by this study was the relationship between Chapter I 
and regular instruction. Previous research and policy analyses have been 
critical of the Chapter I program on several counts related to this issue. The 
program has been criticized for adding little instructional time to participat- 
ing students' instruction and for taking time away from instruction in other 
subjects; the program also has been criticized for providing instruction that 
is incongruent with instruction received in the regular classroom (Walberg, 
1984; Johnston, Allington, and Afflerbach, 1985). 

The data on students' instructional days only partially confirmed the 
first criticism. As discussed above, Chapter I instruction in most schools 
occurred within the school day and thus added little time to a student's total 
daily instruction. At the same time, however, this pattern did not appear to 
cause students to miss large amounts of instruction in other academic 
subjects. An analysis of quantitative data suggested that, on average, 
students gained only about ten minutes of instruction in reading or math on a 
day when they received Chapter I services, and while this slight gain came 
at the expense of instruction in other academic subjects, it did not appear to 
result in radical redistribution of learning time across academic subjects. 
The qualitative data suggested an explanation for this finding. Most of the 
schools in the study scheduled Chapter 1 classes so that they would be 
minimally disruptive to students, and this usually involved offering Chapter 
I services while same-subject instruction was being offered in the regular 
classroom. The only exceptions to this general tendency were an extended 
pullout program in one elementary school, which pulled students out of their 
regular classrooms for two hours in the afternoon during a nine-week period 
of the school year, and several secondary school replacement projects in 
which Chapter I instruction replaced various clcctives in students' academic 
programs. Thus the data suggested that in most schools participation in the 
Chapter I program had little effect on the amount of time students spent in 
different academic subjects (see Rowan ct al., 1986, chapter 6). 

Despite this consistent scheduling pattern, there were important differ- 
ences in the extent to which regular and Chapter 1 instructional programs 
were coordinated at sites in the study. In particular, the study found that the 
relationship between the content of lessons in the Chapter 1 and regular 
programs varied greatly across sites. In the final report of the study, a 
typology was developed to describe the relationship between the content of 
lesson assignments across programs in a school. In some schools, a 
"supportive" assignment pattern existed. In this pattern, Chapter 1 lesson 
assignments were explicitly designed to reinforce lesson content previously 
covered in the regular classroom. Other schools developed "alternative" 
assignment patterns in which Chapter I lessons bore little or no relationship 
to lessons in the regular program. Thus the data in this study did not confirm 
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a simple generalization about the lack of congruence between Chapter I and 
regular instruction. In fact, over half of the elementary schols in the sample, 
and two of the seven secondary schools in the study, maintained a 
"supportive** assignment pattern that established congruent lesson content 
in the Chapter 1 and regular instructional programs (sec Rowan et al., 1986, 
chapter 8 for a discussion of these findings). 

An important question is whether one type of assignment pattern is 
more effective than another in promoting student achievement. In this 
study, data on student success rates were used to provide insight into this 
question. An analysis of the narrative Held records suggested that there was 
no simple relationship between assignment patterns and student success in 
lessons. For students who were performing near grade level, "supportive" 
Chapter I assignments which reinforced regular classroom instruction 
appeared effective in maintaining and promoting student success in the 
regular academic program. However, "alternative" assignment patterns 
appeared more appropriate for students who were performing well below 
grade level or who had instructional needs that were not addressed by 
regular curricula, because these students usually required instruction that 
was far off the pace of that which occurred in the regular classroom, they 
usually did not experience much success during regular lessons and bene- 
fitted much more from the more appropriate instruction provided by the 
"alternative" assignments in the Chapter 1 program. Thus the data suggest 
that for the slowest students in a school, a Chapter I program that replaces 
grade level instruction may be the most appropriate, whereas for students 
near grade level, "supportive" assignments in Chapter I can sustain grade 
level performance. Despite the fact that no single assignment pattern 
appeared appropriate for all students, not a single Chapter I program in this 
study provided different assignment patterns to students with different 
instructional needs. 

Finally, it was found that formal procedures for coordinating Chapter I 
and regular instruction were necessary but not sufficient to the integration of 
the two instructional programs within a school. Formal policies about 
curriculum and evaluation* formal organization of school staff into teams or 
planning units that included Chapter I staff, and the formal scheduling of 
joint planning times for Chapter I and regular staff all facilitated coordi- 
nation of Chapter I and regular instruction. However, schools that showed 
the tightest coupling between Chapter 1 and regular instruction were those in 
which staff endorsed a norm of collegiality and had developed shared beliefs 
about instruction. 

CONCLUSIONS 



A major purpose of this study was to review the evidence on the quality of 
instruction in a variety of Chapter 1 schools and to formulate some 
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suggestions for improvement. The great variety in Chapter j "treatments" 
across schools makes generalizations about program improvement very 
difficult. However, there was one uniform characteristic of Chapter I 
instruction in the schools in this study. All of the Chapter I projects offered 
instruction in small groups. It makes sense to assume that the results of past 
large-scale evaluations of the Chapter I program have been influenced by 
this trend, and that participation in smaller-sized Chapter I instructional 
groups has given students in the Chapter I "treatment** a small advantage 
over students not participating in Chapter I. At the same time, however, the 
small effects of the Chapter 1 program in macro evaluations makes sense in 
light of this finding. Since Chapter I instruction usually accounts for less 
than 100 hours of learning time over the course of an academic year, the 
effects of reduced group size on student achievement should not be large. 

The data also suggest another reason why past evaluations have found 
weak effects of program participation on student achievement. In general, 
participation in Chapter I services had very little effect on the total amount 
of time students in this study spent in reading and math lessons. This was 
especially true of many replacement projects, and it was also true of most 
pullout and in-class projects. Instead of adding to the amount of time 
students spent in reading and math lessons, schools simply redistributed a 
fixed amount of instructional time across programs. This process would not 
give Chapter I students who were "slow learners*' any additional time :o 
learn basic skills. 

Apart from these general observations, however, it appears that 
Chapter I projects implemented instructional programs with very site- 
specific characteristics. This finding has a number of implications for how 
policymakers, researchers, and practitioners think about improving the 
program. The analyses reported here suggest: 

1. Policymakers and practitioners should recognize that improvement in 
the Chapter I program will occur on a sitc-by-sitc basis. Given the weak 
constraint? of federal statutes and regulations and the variable imple- 
mentation of project design features at local sites, it appears unlikely 
that sweeping reforms can effect uniform changes in local instructional 
programs. 

2. Policymakers and practitioners should recognize that the adoption of a 
particular service delivery model (e.g., pullout or in-class) is not the 
major consideration in thinking about how to improve Chapter I 
instruction. Other variables, such as the amount of "direct" instruction 
or the nature of the curriculum arc more likely to affect the quality of 
instruction received by students, 

3. Policymakers and practitioners should consider how time can be better 
used in Chapter I projects. Instead of redistributing the fixed amount of 
time in students* daily schedules across different programs, Chapter I 

: ' i. 
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funds might be better used to purchase add-on services that increase the 
amount of time students spend in basic skills instruction. By placing 
Chapter I instruction outside the regular school day, these designs not 
only add to instructional time, but also prevent students from missing 
instruction in other academic subjects. 

4. Policymakers and practitioners need to give more careful attention to 
the curriculum linkage between Chapter 1 and regular instruction. The 
implementation of "supportive" or^alternative" lesson assignments 
should be done only after a careful assessment of whether or not 
students 1 needs can be met by the regular curriculum. When students' 
needs arc unmet by the regular program, alternative instructional 
assignments may be required; when students can be maintained at grade 
level with minimal support, supportive assignments are appropriate. 

5. Chapter I projects at all levels should expose students to higher-order 
thinking skills, especially opportunities to read connected text and to 
apply mathematics to real world problems. Although the focus of 
Chapter I instruction on basic skills provides students with useful 
review and practice, as students become older this review does little to 
support student learning in the regular classroom curriculums. 

6. Chapter 1 projects in secondary schools should move away from 
self-paced instructional formats that rely on independent scatwork. Too 
often, secondary school Chapter 1 projects allowed the materials, rather 
than the instructor, to provide the instruction for Chapter 1 students. 

The implementation of these suggestions has the potential to increase 
the scope and quality of Chapter 1 instruction and could allow local projects 
to better meet the special educational needs of the students served. 
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INSTRUCTIONAL SETTING 
AND OTHER DESIGN FEATURES 
OF COMPENSATORY 
EDUCATION PROGRAMS 

Francis X. Archambault, Jr. 

University of Connecticut 



The past decade has been a particuiariy fruitful one for research on effective 
educational programs. We have learned a great deal about factors that 
distinguish effective from not-so-effective schools. We have also uncovered 
a wealth of information about teaching practices that contribute to improved 
student performance. Much of what we have learned from the effective 
schools literature can be used in schools serving the academically disadvan- 
taged. In fact, such research has usually been conducted in this type of 
environment. Fortunately, some of the effective teaching research is also 
transportable to these schools and the classrooms within them. Nonetheless, 
much remains to be learned about the types of classroom practices and 
strategies that will lead to improved performance for compensatory educa- 
tion students. 

This chapter is concerned with what the literature says about the 
effects of certain design features of compensatory education programs. 
Specifically, it focuses on the effect of instructional setting (i.e., whether 
instruction is delivered in class or outside the regular classroom in a puiiout 
setting), the size of the class and the types of groups in which instruction is 
offered, and the effectiveness of various types of instructors. Because the 
large majority of the dollars allocated to compensatory education provides 
remedial instruction in reading, language arts, and mathematics for elemen- 
tary school students, these content areas and grade levels are emphasized. 
Further, because most compensatory instruction is provided in a puiiout 
setting, and since the choice of setting may influence other design features, 
this design feature is emphasized. 
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THE OVERALL EFFECT OF COMPENSATORY 
EDUCATION 

In 1965, Congress passed the Elementary and Secondary Education Act 
(ESEA). Title I of this Act was designed to improve the educational 
opportunities of the poor and educationally disadvantaged by providing 
federal dollars to all states and, through them, to local school districts most 
in need. Given the proposed large investment in Title I and the concern of 
many congressional leaders that federal funds would not be used as 
intended, ESEA required that the Title I program be evaluated. These 
evalurtions were the first systematic studies of the overall effect of 
compensatory education programs. However, even before these studies had 
begun, the Coleman Report conclusion that "schools bring little influence to 
bear on a child's achievement that is independent of his background and 
general social control" (Coleman, Carrpbell, Hobson, McPartland, Mood, 
Weinfeld, and York, 1966, p. 274) was being interpreted by many as an 
indictment of Title I (Stickney and Piunkett, 1982). 

Wargo, Tallmadge, Michaels, Lipe, and Morris (1972) provide the best 
review of the early evaluations of Title I. They concluded that there was 
little evidence to support the contention that Title I had a positive impact on 
participating students. Stickney and Plunkett (1982) have argued that the 
disappointing findings of these early evaluations may have been different 
had the scores for all participants not been lumped together. That is, some 
children did appear to benefit from Title I, but, when their scores were 
combined with those of students who did not, the positive effect was 
neutralized. McLaughlin (1975) reached a different conclusion. She argued 
that the initial attempts to evaluate Title I were unsuccessful because of the 
newness of the program, the diverse nature of the instructional components, 
and problems in obtaining cooperation between the federally sponsored 
cvaluators and local school personnel. 

Carter (1984) has reported on a more recent study of Title I, one that he 
claims is the largest stuity of elementary education ever conducted. This 
research, known as the Si staining Effects Study, focused on about 1 ,100 to 
3,000 Title I students, abot 1 1,300 to 2,000 needy students, and about 1,700 
to 2,500 regular students in each of the grades I through 6. Title I students 
were reported by their school to be receiving some Title I services, needy 
students were reported by their teachers as being in need of Title I but not 
receiving them, and regular students were reported as not needing Title I 
services and not receiving them. The major results of this investigation were 
the following: 

1 . Compared to needy students, Title I students showed significant gains in 
mathematics for each of the six grade levels. 
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2. Compared to needy students. Title 1 students showed significant gains in 
reading for grades 1 through 3 but not for grades 4 through 6. 

3. The largest gains in reading and mathematics occurred in the first grade. 

4. The rate of gain for Title 1 students was at least equal to that for regular 
students, whereas the rate of gain for needy students who did not 
receive Title I was not as great as that of cither Title I students or regular 
students. 

5. Title I services were found to be effective for students who were only 
moderately disadvantaged (near average in achievement) but they did 
not improve the relative achievement of the most disadvantaged part of 
the school population. 

6. By the time students reached junior high school, there was no evidence 
of sustained or delayed effects of Title I. 

How should these results be interpreted? Docs compensatory education 
have a sufficiently positive impact on students to justify the enormous 
amount of money (over $40 billion) that has been allocated to it? 

Wang (1980) reported that although compensatory education has had a 
positive impact on achievement growth, the effects of compensatory educa- 
tion efforts are not large. In fact, she concluded that the "evidence for 
positive effects of special instruction on achievement growth is sparse" (p. 
8). Haiiinan (1986) concludes that the results of the Sustaining Effects Study 
arc disappointing. She argues that "the desired outcome of a program of this 
magnitude would be steady, significant achievement gains that arc sustained 
throughout a student's school career" (p. 4). Coolcy (1981) concludes that 
"on balance. Title I is not producing the kind of achievement impact that 
people had hoped it would" (p. 300). 

Stickncy and Plunkctt (1982) offer a different conclusion for the 
Sustaining Effects findings, which they contend is influenced by data from 
the National Assessment of Educational Progress (NAEP). This latter 
research (NAEP, 1981) found that between 1970 and 1980 black nine-year- 
olds gamed 9.9 percentage points on their reading assessment measure 
whereas whites improved their achievement by only 2.8 percentage points! 
NAEP also reported significantly greater gains between 1970 and 1980 for 
students in Title I eligible schools for all three grade levels tested. According 
to NAEP "these significant changes and the overall pattern of a narrowing 
gap for most population groups at all ages strongly suggest that students in 
Title I schools are improving at a faster rate than students in non-Title I 
schools" (cited in Stickncy and Plunkctt, 1982, p. 381). Stickncy and 
Plunkctt caution that since the NAEP conclusions are based on correlations 
one cannot argue that Title I has caused greater pupil achievement' 
However, they do conclude that the Sustaining Effects and NAEP studies 
"have finally reported what a host of smaller scale evaluations have 
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suggested for years: Title 1 is associated with greater than expected pupil 
gains" (p. 380). Forbes (1985) is even more sanguine about the positive 
impact of compensatory education programs. In a relatively recent Phi Delta 
Kappa article Forbes concluded the following: 

Many things that happened in the Seventies could have dramatically affected 
student achievement in reading, among them: Title 1, desegregation, sensitivity 
to serving students who had been underserved in the past, movements to 
improve public education in the southeastern states, and a focus on social 
concerns. Of these, any could have had a major impact on student perfor- 
mance. But the movement that stands out above all others during the Seventies 
is the effort to better educate the economically disadvantaged. Title I and all 
the associated national and state efforts probably played a central role in the 
improvement of reading skills among those students who historically per- 
formed less well (p. 544). 

In 1981, the Education and Improvement Act (ECIA) was enacted, 
thereby replacing Title I with Chapter I. Although we have little systematic 
data on the effect of this relatively new Act, Carter (1984) claims that the 
results "for Title I will apply equally well to the new Chapter I compen- 
satory education program" (p. 4). Coolcy (1981), however, contends that the 
search for overall effects of Title I (now Chapter I) is an inappropriate goal. 
Rather, he suggests that researchers and evaluators should stop trying to 
justify this federal funding program and focus instead on ways in which 
compensatory education practices can be improved. Thus, Coolcy and 
others arc arguing that we should not be asking whether programs like 
Chapter 1 make a difference, but rather how we can adjust or align the 
programs so that they do make a difference. As noted above, we have 
learned a great deal in the recent past about the characteristics and features 
of educational programs that work. It is now time to adopt certain features 
that have been shown to be effective in compensatory education environ- 
ments and to test others that have shown promise in noncompensatory 
settings. Thus we now turn to a discussion of design features that must be 
considered in the delivery of compensatory educational programs. 

SETTING FOR INSTRUCTION 

The first design feature we consider is the setting or location for instruction. 
Compensatory education can be delivered within the regular classroom, 
outside the regular classroom, or partially inside and partially outside the 
regular classroom. Instruction delivered outside the regular classroom has 
been given the name pullout instruction, since recipients are removed or 
pulled out of their regular classroom and sent to another location. Usually, 
the location is within the same school building, and the instruction delivered 
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there is usually provided by an instructional specialist working with smaller 
groups of children than there were in the classroom from which the Chapter 
1 student was removed. Instruction delivered within the regular classroom is 
called in-class or mainstreamed instruction. One approach to mainstreaming 
is for the regular teacher to instruct Chapter 1 students while Chapter 1 
personnel, either a specialist teacher or an aide, provide supplementary 
assistance. Another form of mainstreaming occurs when a Chapter 1 staff 
person instructs Chapter I students while the regular teacher works with the 
other students in the class. A third, albeit less popular strategy, is known as 
the replacement model, which provides in one classroom all the services a 
Title I student receives in a given subject. Finally, there arc add-on 
programs, which result from an extension of the school year (i.e., summer 
programs), the school week, or the school day. 

What do we know about the popularity of these approaches? Carter 
(1984) reported that compensatory instruction was typically provided in a 
pullout setting, as did Vanecko, Ames, and Archambault (1980), Archam- 
bault and St. Pierre (1980), Kennedy (1978), and Stonehill and Anderson 
(1982). NIE (1977) reported that almost 75 percent of the students in Title I 
reading programs received their instruction in a pullout setting. About 41 
percent of the compensatory language arts instruction was also delivered in 
a pullout setting, as was about 45 percent of the mathematics instruction. 
NIE also reported that about 24 percent of compensatory education students 
receive their regular instruction in groups comprised of only CE students. 
Ghiss and Smith (1977) claim that when the NIE pullout figures arc corrected 
to eliminate pupils in 100 percent Title I-eligibie classrooms who do not need 
to be pulled out, the pullout rates in all other classrooms rise to 84 percent 
for reading, 54 percent for mathematics, and 50 percent for language arts. 
Glass and Smith go on to argue that when one considers further that pupils 
might be "pulled out" for one of these subjects and not the other, it is 
plausible to say that in classes not 100% Title I eligible * 'the practice of pull 
out 9 for compensatory teaching is nearly universal" (p. 2). 

More recently, Advanced Technology, Inc. (1983) reported that over 
90 percent of the districts in their sample employed a pullout design 
exclusively or in combination with an in-class model; only 30 percent of the 
districts used an in-class design. Advanced Technology also reported, 
however, that pullout programs were being used less frequently than they 
had been in the past and that the use of in-class designs was on the rise, 
particularly in larger school districts (i.e., over 10,000 students). They found 
much less frequent use of the replacement model. Although this chapter will 
not address the use of add-on programs, partly because the data on them arc 
particularly skimpy, it is interesting to note that summer programs were 
available in over half (51 percent) of the 5,010 schools surveyed in the 
Sustaining Effects Study. As reported by Heyns (1986), larger schools are 
more likely to conduct summer programs than smaller schools. Interested 
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readers arc referred to the Heyns paper for more details on summer 
programs. It should be noted, however, that she concludes that such 
programs arc not unequivocally successful. 

Impact of Setting on Achievement 

Given the predominant use of the pullout and in-class modes of instruction, 
this discussion of the impact of setting on achievement will focus on these 
two models. In general, the literature leads to the following conclusions: (1) 
in some cases, compensatory education in a pullout setting results in higher 
student achievement than in-class instruction; in other instances, in-class 
instruction has & more positive effect; and in still others, and these appear to 
be the majority of cases, the setting for instruction has no significant effect 
on student achievement; (2) the findings on the cost effectiveness of the 
approaches arc also mixed; and (3) the type of instruction delivered within a 
particular setting is more important than the setting itself. 

One of the first large-scale investigations of the effect of instructional 
setting for compensatory education students was the third-year evaluation 
of the Emergency School Aid Act (Coulson, Ozcnne, Hancs, Bradford, 
Dohcrty, Duck, and Hcmcnway, 1977). In one part of this evaluation 
residual gain scores on individual student reading and math tests were 
regressed on nineteen variables that were thought to be related to achieve- 
ment. One of the nineteen variables was the proportion of reading or math 
instruction received in a pullout environment. The 8,319 students participat- 
ing in this research were drawn from two types of compensatory education 
programs and from grades 3, 4, and 5. Since both reading and mathematics 
scores were analyzed, a total of twelve regression equations were derived, 
one for each program, grade level, and content area. For eight of these 
analyses, no significant differences were found for the pullout variable. In 
the other four instances, three of which were concerned with reading, 
pullout was found to have a significant negative effect on achievement. 
Lcinhardt and Pallcy (1982) interpret these findings to mean that "a change 
from receiving no pullout instruction in reading to receiving one half reading 
instruction in pullout would be associated with a small reduction in the 
dependent measure' * (p. 569). They also argue that no matter what the 
dependent measure was, the effect was very small. 

Perhaps even more important than the small effect size is the strong 
possibility that pullout was associated with less gain because (I) less able 
students tended to get pullout instruction and (2) the pretest did not control 
adequately for such differences. Further, and as suggested by Coulson and 
colleagues (1977), schools were more likely to use pullout instruction when 
they had students with more severe problems. 

The Instructional Dimensions Study (IDS) conducted for NIE by the 
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Learning Research and Development Center (LRDC) at the University of 
Pittsburgh also was concerned with the effect of pullout. This research 
focused on reading and mathematics instruction in first and third grade and 
used classrooms as the basic unit of analysis. Here the variable 4 4 4 t ing" 
was scaled as a composite of the time students were in sup, .ncntal 
instruction outside the regular classroom and the number of children in a 
classroom that were pulled out for instruction. NIE (1977) reported that the 
results uncovered in this research differed by content area and grade level. 
First-grade students receiving instruction in the regular classroom (i.e., the 
in-class setting) made significantly larger gains in reading and mathematics 
than those in pullout settings. However, at the third-grade level, setting had 
no significant effect on reading achievement and pullout was associated with 
larger achievement gains in mathematics. NIE interpreted these results to 
mean that neither setting is consistently associated with greater instructional 
effectiveness. 

Frechtling and Hammond (1978) reported on a reanalysis of the NIE 
findings. These researchers, who were part of the team monitoring the IDS 
study for NIE, noted that rcchecking and cleaning the data changed the 
previously described picture slightly. As before, in-class instruction was 
found to have a positive effect on achievement at grade I for both reading 
and mathematics. At grade 3, however, in-class instruction was found to be 
more effective for reading (there was no difference previously) whereas no 
significant difference was found for instructional setting when mathematics 
was considered (pullout was more effective previously) (sec Figure 8-1). 
Frechtling and Hammond (1978) argued that despite these changes in the 
previously reported findings they would 44 bc uncomfortable concluding from 
the IDS data that districts should use only the inclass approach 4 * (p. 9). They 
offered several reasons for their position. (I) the IDS findings were based on 
data from districts that were "special" and not nationally representative; (2) 
they had a gut feeling that mainstrcaming(i.c., in-class instruction) may pose 
problems for many teachers and simply not be possible in some instances; 
and (3) they were unwilling to generalize from a single study. On this last 
point, they argued that 44 the IDS, despite all its virtues, is only one study, 
and convergent evidence from other work is not yet strong enough" (p. 9). 

Coolcy and Leinhardt (1980) and Leinhardt and Pallcy (1982) also 
report on the IDS study and, to make matters somewhat more confusing, 
their findings arc different from those reported by NIE. Coolcy and 
Leinhardt (1980) reported that setting was related to student achievement 
gain only in third-grade reading and that more time in pullout was associated 
with less gain. As can be seen in Figure 8-1, this is quite different from 
NIE's interpretation of the findings. 

Leinhardt and Pallcy (1982) discuss a reanalysis of the IDS data in 
which they at least partially addressed a criticism by Glass and Smith (1977) 
that the pretest measure used was too difficult and thus did very little to 
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FIGURE &-L Various Perspectives on the Fullout Findings of the Instructional 
Dimensions Study 41 
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Key: I* = in-class students outperform pull'jut students. P* - pullout students outper- 
form in-class students; ND = no difference. 

*The Instructional Dimensions Study was commissioned and monitored by NIE and 
conducted by the Learning Research and Development Center at the University of 
Pittsburgh. 

adjust for initial group differences. In these analyses they included only 
children who had received pullout instruction and they treated setting as a 
student level variable rather than the classroom level variable employed in 
their previous analysis. They also changed the nature of the setting variable, 
redefining it as the number of minutes a child was in pullout. This reanalysis 
led them to conclude the following: 



Reanalyzing the data with only those children that received pullout changed 
the picture dramatically. In the analysis with the non-pullout students, there 
was a negative relationship between the time spent in pullout and posttest in all 
cells. When the non-pullout students were excluded from the analysis, the 
results completely changed. There was a positive significant relationship 
between pullout and posttest in all cases except third grade reading (p. 568). 

By way of explaining differences across the two analyses, Leinhardt and 
Palley (I982) offer the following: 

The most plausible interpretation of these results is that the nonpullout 
students had more academic knowledge than the pullout students, that is, the 
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poorer students were chosen for pullout. Hence, pullout was associated with 
less achievement because it had less able students and not because it was an 
inferior practice. Pretest did not adequately adjust for all the initial differences. 
In this case, the underadjustment would be particularly faulty for the first grade 
since (as Glass and Smith, 1977, pointed out) the pretest was not a good 
measure of initial abilities. Examining the results with thr nonpullout students 
removed, we see the following: Pullout is still associated with less gain in third 
grade reading but not in the other three cases. Overall, the more time a student 
spent in the segregated pullout class, the greater thj gain. It seems likely that 
effective practices were being implemented in these settings. It cannot be 
determined whether students who were pulled out suffered some initial losses 
due to having been removed from the regular class. We can say that once 
pulled om, they increasingly gained from more pullout instruction (p. 568). 

Prior to these rcanalyses, Glass and Smith (1977) looked at the results 
of both the IDS study, as reported by NIE, and the ESAA evaluation and 
found that the ESAA study provided a better data base and more believable 
evidence for assessing the effects of pullout. Their review of the ESAA 
findings led them to conclude that there was a consistent negative relation- 
ship between the percentage of time pupils spend in the pullout setting and 
their mathematics and reading achievement. They also concluded that 
"pupils pulled out of their regular classrooms would have to receive 
remarkably effective compensatory programs to offset the potential risks 
incurred. In our opinion, the 'pulled out* pupil is placed in moderate 
jeopardy of being dysfunctionally labelled, of missing opportunities for peer 
tutoring and role modeling, and of being segregated from pupils of different 
ethnic groups" (p. 41). Finally, they concluded that the "pulled out" 
procedure per se had no clear academic or social benefits and may, in fact, 
be detrimental to pupils' progress and adjustment to school. 

From their paper, it appears that Glass and Smith were willing lo base 
their conclusions about pullout on the results of a single study. Obviously, 
other studies influenced their decision, as suggested by their review of the 
impact of variables such as ability grouping, labeling, peer tutoring, racial 
desegregation, and mainstreaming the handicapped. But, it appears that the 
ESAA findings were the sine qua non of their argument, With these findings 
in hand they could argue that pullout is not only ineffective in improving 
achievement but that it also produces harmful labeling of students, it serves 
as a means for reaegregating students, and so forth. But is pullout truly 
ineffective in improving achievement? The reanalysis of the IDS data argue 
that it is not, and, based on these findings, it appears that Glass and Smith 
(1977) may have been too negatively disposed to the impact of pullout, at 
least on achievement, But, at the same time, the reanalyzed IDS data do not 
suggest that pullout is the best method for improving the achievement of 
compensatory education students. Rather, they seem to suggest that pullout 
is one method that should be considered. Cooley (1981) captures the essence 
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of these mixed reactions: "No doubt there are schools in which the pullout 
practice is being done well, but there are certainly schools in which they may 
be doing more harm than good" (p. 300). 

More recently, a few studies have looked at the impact of pullout on 
achievement, but, witn one exception, these studies are much narrower in 
scope and in significance than those reported above. Consequently, they do 
little more than add fuel to the setting debate fire. Nonetheless, they are 
worthy of consideration, and we turn now to a brief discussion of them. 

The Sustaining Effects Study did not directly address the question of 
the impact of pullout on achievement. However, Carter (1984) did conclude 
that pullout was the most typical mode of instruction in Title I programs and 
that Title I did make a difference in achievement, if only for moderately 
academically disadvantaged students. It could be argued that Carter thus 
indirectly concluded that pullout has some effect on achievement. However, 
and as is increasingly the case in the literature, Carter was more interested in 
describing the effect that particular classroom practices had on achievement 
than in discussing the effect of setting. Those variables that his research 
found to be important were opportunity to learn, lower student/staff ratio, 
allocation of teachers* time, classroom management practices, source of 
student off-task behavior, supervision of students, and coordination of 
instruction. He also found years of teaching experience and the instructional 
leadership of the principal to be important variables. 

Yap (1983) also addressed the question of pullout in his secondary 
analysis of data acquired over a three-year period (1978 to 1981) throughout 
the state of Hawaii. In this research, Yap compared pullout programs with 
both in-class and combined in-class/pullout programs. These comparisons 
were performed separately for reading and mathematics, and for reading 
only by year, using analysis of covariance procedures in which an NCE gain 
score was the dependent variable, type of program was the independent 
variable, and pretest NCE and per pupil cost were covariates. Yap offers no 
explanation for his selection of per pupil cost as a covariate. Moreover, 
although he reports sizeable differences in per pupil cost (e.g., almost $350 
more per pupil in 1978-79 for pullout versus in-class instruction), he does 
not describe how this covariate effects the dependent measure, if at all, and 
it is not possible to determine from the results whether it did. Yap also does 
not explain why he chose to use the more unreliable gain score as the 
dependent variable, particularly given the use of the NCE pretest score as a 
covariate. 

Yap reported that there were some 300 "projects" involved in his 
research. That is, 287 schools were involved in the reading analysis and 26 
were involved in the mathematics analysis. For reading, 73 schools were 
involved in 1978-79, 109 schools in 1979-80, and 103 schools in 1980-81, 
and in a!) years over 60 percent of them were pullout schools. Yap bases his 
major conclusions on school level comparisons. This means that NCEs had 
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to be aggregated across grades. It also means that only one type of setting 
must have been employed at a given school. Yap offers no guidance as to 
whether this did occur. 

Clearly, Yap's study had some weaknesses, and, consequently, his 
results are somewhat suspect. Nonetheless, it is interesting to note that Yap 
found no negative achievement effects resulting from pullout. In fact, his 
school-level analyses reveal a trend indicating the superiority of pullout 
programs over both in-class and combined pullout/in-class programs. Given 
small sample sizes and missing data problems, Yap reported that he was 
unable to ascertain through a formal analysis of variance or covariance 
whether grade level interacted with setting. However, an inspection of 
unadjusted and adjusted means indicated to Yap (no means were presented) 
that, although such effects were not substantial, there may be grade level 
differences. Taken together, these results led Yap to conclude that in terms 
of achievement gain, students who received Chapter 1 instruction in the 
pullout setting are likely to perform as well as, if not better than, their 
counterparts in other settings. He also concluded that, despite some 
drawbacks, the pullout setting was a viable option for providing services to 
Chapter 1 youngsters. 

In a 1985 paper, Yap also argued for the cost effectiveness of the 
pullout approach. Using the data described above, with all of the inherent 
weaknesses ascribed to them, he found that pullout produced a unit of NCE 
gain for every $81 spent. Corresponding figures for in-class and combination 
programs were $84 and $86, respectively. These findings disagree with 
Fitzgerald and Hunt (1985). In this study of about one-third of the school 
districts in Minnesota, pullout and in-class settings were found to be equally 
effective in producing student achievement gains. However, the pullout 
setting was found to be more costly and to show a trend toward lower cost 
effectiveness than in-class instruction. Fitzgerald and Hunt argue that these 
increased costs might be due to the use of licensed teachers in pullout 
settings and teacher aides for supplemental in-class instruction Given the 
scope of this study (141 randomly selected districts) and the quality of the 
methods used throughout, it is possible that the Fitzgerald and Hunt findings 
on cost effectiveness are closer to reality than those reported by Yap. 

Nearine and Pccheone's (1984) study of the Intensive Reading Instruc- 
tion Team (IRIT) Program, which was implemented in Hartford, Connec- 
ticut, and validated by the state of Connecticut and the JDRP, also reported 
positive effects for pullout. This study included 698 students in grades 3 
through 6 whose pre-entry reading performance on the Metropolitan 
Achievement Test fell below the 23rd percentile. The comparison group 
contained 124 third-grade through sixth-grade students who also scored 
below the 23rd percentile but who were not selected for the program (i.e., a 
matched sample), Since students entered this ten-week intensive program at 
different times during the school year, comparisons could also be made 
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across three "cycles. " Thus posttcst performance of students completing 
earlier cycles could be compared with pretest performance of students 
entering a later cycle. Ncarinc and Pcchconc reported gains of at least six 
NCEs across the various reading and language arts analyses. They also 
reported that students who competed an fJuT cycle performed higher than 
students entering the next cycle. Finally, they found that the gains in reading 
were sustained throughout the school ye^r (i.e., beyond the time of IR1T 
instruction). However, they did not find significant posttest differences 
between IR1T students and those in the comparison group, who outper- 
formed the 1R1T students on the pretest, although not significantly. 

Knight (1979a v 1979b, and 1979c) also reported positive effects of 
pullout in studies of several schools in New York City. In each of these 
studies the actual posttest scores of pullout participants were compared 
against the scores students would be expected or predicted to receive 
without intervention. In all of the comparisons performed across the three 
studies, actual posttcst performance exceeded predicted levels. Further, all 
but six of the more than twenty-five comparisons across various grade levels 
and content areas were statistically significant. Hov/cvcr, the weaknesses of 
the research design and alpha problems associated with multiple compari- 
sons render these findings somewhat suspect. 

Finally, Madhcrc (1981) reported that pullout was more effective than 
in-class instruction in improving student achievement. Madhcrc also argued 
that pullout appeared to be most effective in an environment where the Title 
I pupils constitute a majority of the students in the school. He contended 
that in such a context, the impact of labeling a student in need of assistance, 
a negative consequence according to the critics of pullout, may be mean- 
ingless. 

Doss and Hollcy (1982), on the otter hand, reported that pullout 
programs were not effective. They argued that compensatory programs 
conducted during the regular day inevitably supplanted regular instruction. 
They also argued that one of the reasons for the ineffectiveness of pullout 
programs was the regular classroom teacher's decreased sense of responsi- 
bility for the special program student. Based on Glass and Smith's (1979) 
mcta analysis of class size and achievement, they argued that reducing class 
size to fifteen and delivering instruction within the regular classroom would 
not only redress some of the problcrrs occurring within pullout programs, 
but would also improve student performance. Since this approach was 
permissible under the "schoolwidc projects" provision of Title 1 in force at 
that time, such a program was implemented on a pilot basis in 1980-81 in 
two Austin, Texas, elementary schools. Analyses were conducted on 
reading and mathematics achievement for students in grades 2 through 6 in 
those schools and in certain Title 1 schools using pullout programs. From 
these analysis, Doss and Hollcy concluded that in most schoolwidc projects 
students outscorcd Title 1 regular students of the same pretest level and that 
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the impact of the schoolwide projects were generally uniform across all 
levels of the pretest. On average, the advantage of the schoolwide projects 
across grades were 2.1 months in reading, 2.5 months in language arts, and 
2.2 months in mathematics. However, these gains were purchased at such 
great expense ("lowering the PTR (pupil/teacher ratio] from 25-1 to 
15-to-l would increase personnel costs by 67% in the schools and grades 
where applied" [Doss and Holley, 1982, p. 10]) that the schoolwide project 
model was not adopted in other Austin schools. Levine and Stark (1982) also 
report the schoolwide projects implemented in the Los Angeles Unified 
School District were so poorly received that they were discontinued, partly 
because they were very expensive and partly because there were questions 
about how well they worked. 

What, then, can we say about the effect of pullout on achievement? 
Based on the research that has been conducted to date, we cannot conclude 
with confidence either that pullout is more effective than in-class instruction 
or that the opposite is true. We can conclude, however, that the achieve- 
ment findings provide little support for the overwhelming, and in some 
locations almost exclusive, use of the pullout model. Rather, the findings 
suggest that both pullout and in-class instruction can be effective, given 
appropriate circumstances, and, as reported by Carter (1984), Leinhardt and 
Palley (1982), Tobias (1982), and ethers, it is likely that these circumstances, 
particularly the type and quality of the instruction delivered within an 
instructional setting, arc more important than the setting itself. Despite this 
general sense, however, it must be reported that pullout programs have been 
opposed for reasons other than their impact on achievement. 

Other Effects of Instructional Setting 

In addition to achievement findings, the literature also includes an increasing 
amount of other information on the effects of in-class versus pullout 
programs. Included among the long list of issues that have been addressed 
arc the following: 

1. The stigma associated with receiving Chapter 1 instruction, particularly 
in a pultout setting 

2. The levels of communication, cooperation, and coordination possible in 
the two settings 

3. The levels of teacher autonomy and control afforded by the settings 

4. The amount and type of content lost when compensatory instruction is 
received, particularly in pullout programs 

5. Class size shifts and resultant gains 

6. Grouping patterns, individualization, and segregation in in-class and 
pullout settings 
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7. The relationship among peers and the opportunity for peer tutoring 

8, The types of instructors and their roles in different settings. 

The first four of these issues will be addressed in this section. The remaining 
four issues will be addressed in sections that foilow. In cither case, much of 
what we know about these issues is derived from relatively small-scale 
observational studies, studies of relatively modest scope reporting on the 
results of interviews with program students, teachers, and administrators, 
and somewhat narrow surveys of the perception of these and other groups. 
Moreover, as was the case with the review of the impact of setting on 
achievement, the data that arc available are generally equivocal. 

Stigma. Lcinhardt and Pallcy (1982) reported that despite contrary 
research findings, special educators strongly believe that special placement 
makes the target child feeS rejected rather than rescued. Likewise, many 
educators believe that compensatory education students feel different if they 
are labeled as Chapter l-eltgiblc and are pulled out of their regular 
classrooms for special instruction (e.g., Lcinhardt and Pallcy, 1982; Shuy, 
1978; Glass and Smith, 1977). Some argue, on the other hand, that 
stigmatization occurs in mainstream programs as well (Noddings, 1978). On 
this score, at least one author has suggested that stigmatization may be 
greater in the mainstream setting since in this environment compensatory 
education students are both singled out and remediated in front of their 
classmates (Hayes, 1983). From a slightly different perspective, it has also 
been argued that pullout is an effective instructional strategy in an environ- 
ment where Chapter 1 students constitute a majority, and that in such 
settings labeling may be meaningless (Madhcrc, 1981). 

Even though the research evidence is inconclusive, it appears reason- 
able to argue that stigmatization occurs in both in-cla.ss and pullout settings: 
in the in-class setting because stents receive extra instruction in front of 
their peers, and in the pullout setting because students arc easily identifiable. 
It also appears that it is not the setting itself that aggravates the feeling of 
being different, but the sensitivity or insensitivity of teachers operating 
within the setting (Noddings, 1978). Kennedy's (1978) advice on this issue 
seems particularly compelling: 

Compensatory education students arc less likely to be subjected to labeling and 
its negative effects in environments where teachers actively encourage chil- 
dren's respect for and appreciation of a variety of human differences; in 
environments where similarities between learning tasks and materials arc 
emphasized; and in environments where Title 1 and non-Title I children 
frequently move, in an organized way, to other parts of the building or 
classroom to receive special instruction (p. 35). 



ERLC 



234 Instructional Setting 



Communication. Cooperation, and Coordination. One of the most 
frequently cited advantages of the in-class setting is the increased commu- 
nication, cooperation, and coordination among regular classroom and com- 
pensatory education instructors that it affords. Advocates of in-class in- 
struction argue that because specialist teachers or aides hired with 
compensatory education funds provide their instruction in the same 
classroom as the regular teacher, there will be increased interaction and 
cooperation in planning for instruction (Bean and Eichclbcrgcr 1985- 
Kennedy, 1978; Hayes, 1983; Neumann, 1985), better integration of re- 
medial and regular instructional strategics (Allington, 1986; Bean and Ei- 
chclbcrgcr, 1985; Kacstlcand Smith, 1982), better opportunities for individ- 
ualization and small-group instruction (Bean and Eichclbcrgcr, 1985), and a 
better flow of information on student progress (Harnischfcgcr, 1980). There 
will also be fewer disruptions caused by students moving in and out of the 
regular classroom (Hayes. 1983). and thus more continuity in instruction, as 
well as an increased opportunity for regular classroom teachers to observe 
other, perhaps more effective, approaches for working with compensatory 
education students (Kennedy, 1978). 

Allington (1986) has offered some particularly powerful comments on 
one of the consequences of pullout, namely the fragmentation of instruction. 
He suggests that pullout remedial reading instruction is typically indepen- 
dent from the reading instruction offered in the regular classroom and that it 
generally employs different materials and teaching methods. He argues 
further that neither the classroom teacher nor the compensatory education 
specialist is likely to know what type of instruction the other is offering. 
Consequently, he claims that few remedial students actually receive instruc- 
tion that supplements their core curriculum; rather, they arc taught by 
classroom and remedial teachers who express different beliefs about student 
needs, offer different objectives as targets for instruction, and use reading 
material that represent different models of the reading process. As a result, 
he argues, students who arc in need of remediation arc offered instruction 
that seems likely to increase their confusion rather than reduce it. Allington 
also acknowledges, however, that pullout programs can offer instruction 
trial is congruent with that provided in the regular classroom (i.e., same or 
similar material, same reading strategy used, and same reading skill taught), 
and, therefore, that the lack of congruence may be related to factors other 
than the instructional setting itself. 

Despite this indictment of pullout programs, it appears that in-class 
programs also present problems, some of which arc the same as those 
prevalent in pullout settings. Bean and Eichclbcrgcr (1985), for example 
noted differences in classroom teachers' and specialists' teaching styles and 
instructional strategics when a large city school district changed from a 
pullout program to one in which specialists worked in the regular classroom. 
Moreover, although teachers in this new program reported increased 
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interaction and cooperation, they also reported problems with scheduling 
and inadequate joint planning time; reading specialists, on the other hand, 
reported that there was a lack of teacher interaction, that they weren't sure 
what their role was in the classroom, that they had insufficient materials, 
and that there was inadequate management. Bean and Eichclbcrgcr (1985) 
concluded that these problems were due to two teachers sharing the same 
room and that, despite the in-class arrangement, team teaching was difficult. 

Vasqucz and Nuttall (1983) reached similar conclusions. These re- 
searchers also argued that it is the classroom teacher, not the specialist, who 
is forced to schedule activities that will accommodate another class in the 
same room and that the coordination between the regular and compensatory 
program is still accomplished outside of the regular classroom, before 
school, during recess, during lunch, or after school. They argued further that 
even though the materials used in the regular and remedial programs arc 
more likely to be similar in in-class programs, pullout programs use a greater 
variety of instructional materials. Specialist teachers working in the regular 
classrooms apparently find it difficult to transport books and other parapher- 
nalia from classroom to classroom (Bean and Eichelberger agree), find it 
harder to use audiovisual materials because they would be distracting to the 
regular classroom teacher, and, for that same reason, avoid materials that 
require recitations or other "out-loud" work. Despite his enthusiasm for the 
in-class approach, Neumann (1985) concluded that where this type of setting 
is used specialists must be highly organized, must be flexible enough to work 
in several environments, and must have the diplomacy and personality 
needed to work jointly with colleagues. 

Taken together, these results suggest that even though in-class pro- 
grams offer the opportunity for increased communication, cooperation, and 
instructional continuity, they do not ensure that they will occur. Perhaps, as 
suggested by Kennedy (1978), the key to good coordination between 
compensatory and regular instruction is not the setting for instruction but 
rather the quality of the joint planning that occurs, the amount of decision- 
making power given to teachers and specialists by their respective adminis- 
trators, and the flexibility in scheduling. Whether these characteristics occur 
more frequently in in-class settings remains to be seen, 

Teacher Autonomy and Control Critics of the pullout model have 
argued that when students receive remedial instruction outside the regular 
classroom neither the regular teacher nor the compensatory education 
specialist is will'ng to accept full responsibility for the students' progress 
(Allington, 1986; Doss and Hollcy, 1982). If they so desire, classroom 
teachers can claim that less than acceptable progress is attributable to 
shortcomings of cither the student or the compensatory instruction; com- 
pensatory education specialists, on the other hand, can attribute such a 
result to cither the student or inadequate regular classroom instruction. To 
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avoid such academic fingerprinting, some have argued that the adoption of 
the in-class model would allow the teachers who share the same space to 
share this responsibility or the administrators coordinating the program to 
decide on whose shoulders the responsibility should rest. Despite the wis- 
dom of this argument, it appears that the adoption of an in-class arrangement 
will again not ensure the expected outcome. Bean and Eichelberger (1985), 
for example, reported that both teachers and specialists had problems with 
two instructors in one room ard that the problem of leadership or control 
was a real issue. Vasqucz and Nuttall (1983) reached a similar conclusion, as 
did Shuy (1978), who found that some of the teachers in the Instructional 
Dimensions Study favored pullout because it gave them greater autonomy. 
On the other hand, Shuy also reported that some teachers favored the 
in-class approach because it gave them more control. Unfortunately, the 
literature docs not allow one to resolve this apparent conflict. 

Content Lost. It was argued earlier in this chapter that students 
receiving compensatory instruction in a particular content area (e.g., 
reading) do so at the expense of regular instruction in that same content. 
Harnischfeger (1980) concluded that pullout pupils "often receive less 
reading time than the rest of the class because the pulling out— going some 
place and coming back— also entails loss of instructional time for students 
who need more" (p. 4). Regardless of whether this is true, and there is some 
evidence to suggest that the total time allocated to basic skills instruction is 
greater for compensatory education students than for their noncompensa- 
tory education peers (Archambault and St. Pierre, 1980), the implication is 
that some approach other than or in addition to pullout would be more 
effective. 

Harnischfcgcr's solution would be to lengthen the school day for 
compensatory education students (i,c., an add-on approach). Others argue 
thai providing in-class instruction would solve the proolem because then the 
time lost by traveling from place to place would be retained. Allington 
(1986), for example, suggests that as much as fifteen minutes per day, or 
forty hours per academic year, might be saved in this manner. However, 
neither Allington nor anyone else that this reviewer has been able to uncover 
provides direct evidence on what would actually be done with the newly 
found time were compensatory instruction to be provided in the classroom. 
Would the interruptions prevalent in in-class programs (Hayes. 1983) prove 
to be less troublesome than the disruption associated with pullout programs? 
Would the presence of more than one teacher in a single classroom result in 
greater confusion than occurs in pullout programs, particularly when such 
programs arc adopted on a wholesale basis? Additional research is needed 
before we can determine with any confidence whether in-class compen- 
satory education programs result in more or less time allocated to instruc- 
tion. Additional research is also needed to determine whether such diffcr- 
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ences in allocated time are accompanied by differences in the amount of time 
that students in various settings actually engage or actively participate in 
instruction. 

CLASS SIZE, INDIVIDUALIZATION, AND 
GROUPING 

Compensatory instruction is delivered in smaller classes and groups than 
regular classroom instruction. It is also more likely than regular instruction 
to incorporate components of individualized instructional programs. Pullout 
instruction, which comprises the large portion of compensatory instruction, 
is delivered in groups that are more homogeneous in ability than those in the 
classrooms from which compensatory students are removed. Do the reduc- 
tions in class and group size and the increase in individualization of 
instruction have a positive affect on students? Is homogeneous ability 
grouping preferred to heterogeneous ability grouping? We turn now to a 
discussion of these issues. 

Glass and Smith (1979) reported on the results of a meta-analysis of 
seventy-seven studies involving 725 comparisons of larger and smaller 
classes. They found that 60 percent of the achievement comparisons favored 
smaller classes and that the effect was particularly strong when the classes 
contained fifteen or fewer pupils. They also found that this relationship did 
not change significantly for students of different ability levels. On the other 
hand, a review of twenty-four studies conducted by the Educational 
Research Service (1978) concluded that research provided no clearcut 
guidelines for an optimum class size. The ERS review also concluded, 
however, that pupils with lower ability tend to benefit from smaller classes 
more than pupils of average ability and that smaller classes can positively 
affect the scholastic achievement of economically or socially disadvantaged 
pupils. Cooper (1986), in his comprehensive review of the literature, 
summarizes the effect of class size on achievement as follows: 

Both the (Glass and Smith) meta-analysis and the ERS review concluded that 
smaller class sizes benefit the population of students served by Chapter I 
programs. Glass and Smith did so by finding the benefit of small classes held for 
students regardless of their intelligence level. ERS found the benefit for low 
ability or economically disadvantaged children only. Further, the typical size 
of compensatory education instructional groups falls within the range about 
which there appears to be some agreement. It will be recalled that from 
one-third to all of Title 1 students in the districts sampled by the (Title 1) 
Demonstration Study (Archambault and St. Pierre, 1979) received compen- 
satory instruction in groups of one to five students. AH the reviewers agreed 
that this is the end of the class size curv* where effects are most dramatic and 
dependable (p. 9). 
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Hedges and Slock (1983) cite two problems with Glass and Smith's 
estimation procedures. First, they argued that they overestimated effect 
sizes when study Ns were small. Second, they argued that the variance of a 
student's effect size estimate was not based on its sample size, as it should 
have been. Hedges and Stock (1983) thus reanalyzed the same class size data 
taking these problems into account. As might be expected, they found a 
smaller effect size than Glass and Smith. However, they also found that 
smaller classes still led to higher expected achievement and that the shape of 
the curve expressing the relationship between class size and achievement 
found by Glass and Smith remained unchanged. 

Slavin (1984) has criticized both the Glass and Smith study and the 
Hedges and Stock rcanalysis. He argues that the Glass and Smith results 
were heavily influenced by studies of tutoring (i.e., one-to-one instruction, 
or a class size of one). This claim is based on his review of fourteen studies 
listed by Glass and Smith in which students were randomly assigned to large 
or small groups. Fourteen of the thirty contrasts in this studies involved 
comparisons of classes with a median size of twenty-three to those with 
class sizes of one to five, and ten of these involved classes with one student. 
Tiic median effect size for these comparisons was 0.545. Nine involved 
comparisons of classes with a median size of 35 to classes ranging in size 
from fourteen to seventeen pupils. The median effect size for these studies 
was 0.08. Finally, for comparisons involving classes with more than twenty 
students (i.e., median larger classes of thirty-seven and median smaller 
classes of twenty-six), the median effect size was 0.04. Thus, Slavin 
concludes that the effects of tutoring arc positive but that "there is nothing 
in the Glass ct al. meta-analysis to contradict an assertion that class size 
makes no meaningful difference in achievement unless the smaller class has 
no more than three students" (p. 1 1). 

Slavics criticism of Hedges and Stock (1983) focuses on their study 
selection procedures. That is, although he agrees with Hedges and Stock's 
conclusion about overestimated effect sizes, he believes that Hedges and 
Stock's rcanalysis produced the same general conclusion as Glass and Smith 
because they chose to include the same studies as the earlier researchers. 

It appears, then, that for the total population the relationship between 
class size and achievement is not yet fully understood. For compensatory 
education students, however, there is some evidence to suggest that smaller 
class sizes may be beneficial. This conclusion is based on the ERS finding 
that pupils with lower ability tend to benefit from smaller classes. It is also 
based on Cooper's conclusion that the most dramatic effects of class size arc 
at the end of the class size curve, namely classes of five students or less, the 
size of the group in which Chapter I students arc mostly likely to receive 
compensatory instruction. 

The Glass and Smith study also looked at the effect of class size on 
variables other than achievement. They concluded that teacher morale is 
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higher, that their attitude toward pupils is more positive, and that their 
satisfaction with their own performance is greater in smaller classes. They 
also found positive effects of smaller classes on "student attitude," which 
included attitudes toward the teacher, school, and themselves (i.e., self- 
concept), as well as positive effects on student motivation. Cooper (1986) 
summarizes these other results of the Glass and Smith study: 

This category of effects [student attitudes) demonstrated a difference of 29 
percentile ranks between students in classes of size 40 and size 10. Another 
group of effects involving measures of student participation showed a differ- 
ence between classes of 10 and 40 students equal to 23 percentile ranks. 
Finally, a group of variables measuring different types of student misbehaviors 
also favored smaller classes, but the number of comparisons was too small to 
estimate a percentile rank difference (p. II). 

Individualization 

Since the average class size for compensatory instruction in reading, 
language arts, and mathematics arc all below fourteen and the average size 
of the regular classroom is twenty-seven (Archambault and St. Pierre, 1979, 
Carter, 1984; NIE, 1977), it would appear from the above that smaller class 
sizes would be preferred. Moreover, if students in these smaller classes are 
also receiving more individualized instruction, as Glass and Smith (1980) say 
they do, then it would be logical to conclude that class sizes for compen- 
satory education students, whether the instruction is compensatory or not, 
should be small (i.e., at least less than fifteen), and that individualized 
instruction should be emphasized. Before we leap to this conclusion, 
however, we should review what other researchers have to say. 

The Class Size and Instruction Program (CS1P) research of Cahcn, 
Filby, McCutchcon, and Kyle (1983), which included four second-grade 
classes selected from an inncrcity school in California and a rural school in 
Virginia, investigated what happens in classrooms when class size is 
reduced. These researchers concluded that smaller classes made discipline 
easier and that teachers in them spend more time teaching and less time 
policing. They also found that students paid closer attention in smaller 
classes and that group discussions involved more of the students in the 
group. Finally, the researcher* concluded that the effect of participation 
may be most pronounced for low achievers because "in a small group, 
where control is perceived to be easier, the teacher may feel she or he can 
take time to draw all students into the lesson" rather than "rely on 
volunteers or high ability students in order to keep things moving along" 
(cited in Cooper, 1986, p. 13). 

The Class Size and Instruction Program as well as the Sustaining 
Effects Study (Carter, 1984) reported on behavior management issues 
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associated with the sizes of classes and groups within them. The Sustaining 
Effects Study found that the smaller instructional groups and lower student- 
to-instructor ratios associated with Title I programs were associated with 
more student on-task behavior, less teacher time in behavioral management, 
a more harmonious class environment, and a higher quality of cognitive 
monitoring, task monitoring, and organization of activities. The CSIP 
researchers reached the same general conclusions. 

CSIP also looked at the relationship between class size and individual- 
ization. Although the CSIP researchers recognized that smaller classes 
provided the opportunity for greater individualization of instruction, they 
found that smaller classes did not lead to dramatic increases in the amount of 
individualization. Instead, they found that group instruction continued and 
the curriculum remained the same. Harnischfegcr (1980) concluded that, 
even when class sizes are reduced drastically, many teachers do not 
significantly change their teaching strategies. She suggests that the only 
consequence of smaller class sizes is smaller subgroups but not fewer 
groups, and, on the average, no increased teacher/pupil interaction. Har- 
nischfeger goes on to argue that instead of a teacher working with small 
groups one at a time while the rest of the class is on their own, he or she 
should consider whole-class instruction. If they do, Harnischfegcr argues, 
their students will benefit by receiving more direct instruction. 

Despite the evidence provided by the CSIP researchers, one might 
argue that the findings arc suspect because of the small sample of schools 
and classrooms (i.e., two schools and four classrooms involved in the 
research) and that individualization docs occur more frequently in smaller 
classes. Suppose this were true? What do we know about the impact of 
individualization on student outcome measures? 

Coolcy and Lcinhardt (1980) reported they did not find any clear 
evidence for the superiority of individualized instruction over other methods 
of compensatory education. Their review of the literature also suggested 
that this finding is consistent with other attempts to establish the effective- 
ness of individualization. Brophy (1986) has argued that the problem is not 
with the conceptual model of individualization, which calls for beginning 
where students arc and moving them along at their own pace. Rather, the 
problem is that: 



Individualized instruction in the typical classroom shifts a great deal of 
responsibility for planning and managing learning from the teacher to the 
students themselves and shifts responsibility for carrying the content to the 
students from the teacher to the materials. This is workable and may even have 
certain advantages when the teacher is continually available to provide close 
supervision and immediate help when needed, but it docs not work well when 
students must function on their own for extended periods of time, trying to 
learn through interacting with the curriculum materials without much guidance 
or help from the teacher (pp. 13-14). 
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Brophy does state, however, that the individualized training package 
approach can be used effectively in special classrooms with small student/ 
teacher ratios. Basing his conclusion on the results of a study by Crawford 
(1983), Brophy indicates that under such conditions "teachers can move the 
students through curriculum at a faster pace, can provide more tutorial and 
individualized instruction, and can assign more difficult work because they 
are able to monitor everyone's progress and provide immediate help when 
needed" (p. 14). Nonetheless, Brophy's review underscores the role of 
active instruction from the teacher in producing achievement gains: 

Such instruction can be provided in tutorial form in special classes with small 
student/teacher ratios, and it can be provided to small groups in classrooms 
where the teacher has developed appropriate assignments and installed effec- 
tive seatwork management procedures so that students are engaged in worth- 
while academic activities in between their small group lessons from the 
teacher. The individualized approach is not feasible in the typical classroom, 
however, and even the small group approach is likely to strike most teachers as 
more trouble than it is worth unless they have an aide available to supervise 
seatwork while they teach small group lessor.s (or unless they believe that the 
class is so heterogeneous that they are forced to provide differentiated 
instruction to small homogeneous groups). Consequently, most teachers will 
opt for the traditional whole-class instruction/rccitation/scatwork method as 
their primary approach to instruction (p. 15). 

What does Brophy mean by active teaching? Active teaching connotes 
frequent lessons (whole class or small group, depending op. grade level and 
subject matter) in which the teacher presents information and develops 
concepts though lecture and demonstration, elut Jrates this information in 
the feedback given following responses to recitation or discussion questions, 
prepares the students for followup assignments by giving instructions and 
going through practice examples, monitors progress on assignments, and 
follows up with appropriate feedback and rcteaching when necessary. There 
is a great deal of teacher talk, but most of it is academic rather than 
procedural or managerial, and much of it involves asking questions and 
giving feedback rather than extended lecturing. 



Grouping 

The issue of grouping for instruction has been alluded to several times in the 
preceding discussion, but to this point we have said little or nothing about 
how and why groups arc formed and the effects of different grouoing 
patterns. In general, ability grouping refers to the practice by which students 
in more heterogeneous classes or groups arc arranged into more homoge- 
neous classes or groups for all or a portion of their instruction. Most U.S. 
schools follow this practice and most teachers overwhelmingly support it, 
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probably because they believe that homogeneous groups arc easier to teach 
than heterogeneous groups. Dividing students according to ability allows 
teachers to increase the pace and level of instruction for high achievers and 
provide more repetition, review, and remediation for low achievers. Ability 
grouping is also supposed to provide a spur to high achievers by making 
them work harder to achieve, and to place success within the grasp of low 
achievers, who are protected from having to compete with their more able 
agemates (Atkinson and O'Connor, 1963). Critics argue that ability grouping 
confers unnecessary distinction on those in the fast track while placing a 
stigm on those in slow groups (Kulik and Kulik, 1982). They also contend 
that being placed in a low-achieving class or track has damaging psychologi- 
cal effects on students (Oakes, 1985; Rosenbaum, 1980; Schafer and Olexa, 
1971). 

Slavin (1986), in his "best evidence synthesis" of the literature on 
ability grouping in the elementary school, carefully distinguishes within- 
class from between-class ability grouping. Within-class ability grouping 
refers to the assignment of students to more homogeneous groups within a 
heterogeneous class arrangement. The most common form of within-class 
grouping is the use of reading groups, where teachers assign students to one 
of a small number of groups on the basis of reading level. These groups work 
on different materials at rates unique to their needs and abilities. Other 
forms of within-class ability grouping may be found in mathematics, where 
the teacher presents a lesson to the class as a whole, and afterwards, while 
the students are working problems, the teacher provides enrichment or 
extension to a high-achieving group, remediation or rcexploration to lower 
achievers, and something in between to average achievers. According to 
Slavin, group-paced mastery learning (Bloom, 1976) may be seen as one 
form of flexible within-class grouping. Individualized or continuous- 
progress instruction may be seen as extreme forms of ability grouping. 

Slavin (1986) also contends that between-class groups take many 
forms. In fact, he describes the following seven types: 

1. Ab ;i iy Grouped Clas* Assignment: Students arc assigned on the basis 
of ability or achievement to one self-contained class (usually at the 
elementary level) or to one class that moves together from teacher to 
teacher, as in block scheduling in junior high schools. 

2. Curriculum Tracking: A special form of ability-grouped class assign- 
ment unique to the secondary level in which students arc assigned by 
ability or achievement to tracks, such as college preparatory, general, or 
vocational. Students may take all or only some of their courses within 
these tracks. 

3. Regrouping for Reading or Mathematics (Ability Grouping for Selected 
Subjects): Students arc assigned to heterogeneous homeroom classes for 
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part or most of the day, but arc "regrouped" according to achievement 
level for one or more subjects. 

4. Joplin Plan: A special form of regrouping for reading in which students 
are assigned to heterogeneous groups for most of the day but are 
regrouped for reading across grade levels (Floyd, 1954). For example, a 
reading class at the fifth-grade, first-semester reading level might include 
high-achieving fourth graders, average-achieving fifth graders, and low- 
achieving sixth graders. Reading group assignments are frequently 
reviewed, so that students may be reassigned to a different reading class 
if performance warrants if. 

5. Nongraded Plans: In its broadest application, gradc-icvel designations 
are entirely removed, and students are placed in flexible groups ac- 
cording to their performance level, not their age (Goodlad and 
Anderson, 1963). Students move from group to group and complete the 
curriculum at their own pace so that some students might complete the 
primary grades, for example, in two years while others would require 
four years, 

6. Special Classes for High Achievers: Plans in which gifted, talented, or 
otherwise superior students arc assigned to a special class for all or part 
of their school day, while other students remain in relatively heteroge- 
neous classes. 

7. Special Classes for Low Achievers: The assignment of students with 
learning problems to special or remedial classes for part or all of their 
school day. 

Dreeban (1984) suggests that factors other than students' ability or 
past achievement also affect t» eir assignment to groups. Included among 
these additional factors are the distribution of students 4 aptitudes or 
achievement in the school or classroom, organization constraints within the 
school or classroom, and student manager n and discipline. With regard to 
assignment within classes, teachers also consider the number, size, and 
diversity of students within groups when assigning other students to the 
group. Generally, teachers form at least three groups, and groups formed on 
thr basis of ability may vary widely on other dimensions. Nonetheless, 
instruction is geared to the level of ability attributed to the group, 

With regard to within-class grouping, a recent review of the literature 
by Wilkinson (1986) has concluded that students in low-ability reading 
groups fare poorly: 

Students in low [ -oups receive less time to read in comparison with students in 
high groups; oral reading errors made by lows arc more frequently corrected 
than those made by highs (i.e., they arc interrupted more frequently); lows are 
given more instruction in decoding and teacher prompts arc more focused on 



ERLC 



> 4 



244 Instructional Setting 



phonics for lows than for high; lows are frequently less attentive than highs 
(Allington, 1980; Eder, 1931, 1982, 1983, 1984; Rist, 1970; Stern & Shavelson, 
1981). Eder (1981) argues that these practices result in lower reading-test 
performance and lower oral-reading grades for the low ability groups. The poor 
achievement of the lows is thus the result of low expectations by the teachers 
and/or the poor quality of teaching that they receive. At first glance, these 
studies suggest that the differential treatment received by the low ability 
students is the culprit, the cause of their poor reading achievement (pp. 12-13). 

Wilkinson continues: 

The implications for low achieving students seems clear: Homogeneous ability 
grouping is detrimental to learning for students assigned to low groups (Eash, 
1961: Persell, 1977). These studies reveal mechanisms through which the 
negative effect of assignment to a low ability group occurs. There are 
differences in the instructional processes and learning environments of differ- 
ent ability groups. Teachers and students interact in different ways across 
ability groups. We are compelled to agree with Hallinan's (1984, p. 233) 
conclusion that: 'At present, the great majority of studies demonstrate that 
assignment to a low track or ability group places students at a disadvantage in 
terms of learning opportunities* (p. 13). 

Wilkinson also concluded !hat low-ability students seem to benefit in 
achievement by being placed in small, heterogeneous ability groups (Swing 
and Peterson, 1982) and that these groups can provide environments that are 
positive for learning (Cohen, 1984; Webb, 1982, 1985; Webb and Kcnderski, 
1984; Peterson, Wilkinson, Spinclli, and Swing, 1984). 

Slaving (1986) review of the literature on within-class ability grouping 
reached a different conclusion. Basing this portion of his review exclusively 
on researcn in mathematics (because no reading studies met stated criteria 
for inclusion— see Slavin, 1986, pp. 15-16 for a discussion of these criteria), 
Slavin found that every one of the eight studies he located favored 
homogeneous within-class grouping, though not always signifies tly. Five of 
the studies also broke down achievement effects by student ^oility levels, 
but there was no clear pattern of results favoring ability grouping for one or 
another subgroup. On this score, Slavin concluded the following: 

Every subgroup gained more in classes using within-class ability grouping than 
in control (ungrouped) treatments. However, it is interesting to note that the 
median effect size for low achievers (ES = +.65) was higher than that for 
average (ES = +.27) or high achievers (ES = .41) (p. 84). 

Slavin acknowledges that this evidence is surprising in light of several 
studies of ability grouping in reading suggesting that students in low reading 
groups experience a lower quality of instruction than do those in higher 
groups. 
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How can we explain the discrepant findings from the Wilkinson and 
Slavin reviews? One possible explanation is that Wilkinson draws heavily on 
studies in reading, whereas Slavin focuses only on mathematics. More 
important than this, however, is the fact that much of the research cited by 
Wilkinson deals with observations that students in low reading or math 
groups learn less than students in higher groups, not less than other 
low-achieving students in contrasting arrangements. Thus, according to 
Slavin, it is hardly surprising that high and low achievers differ and that their 
teachers' behavior differs accordingly. Slavin (1986) argues further that: 

Comparisons of achievement gains in high and low reading groups are found to 
show an advantage of being in the high group because high achievers learn 
more rapidly than low achievers, and unless measures used to control for initial 
ability are perfectly reliable and perfectly predictive of later reading achieve- 
ment, assignment to the high reading group will appear to lead to higher 
achievement (sec Reichardt, 1979) (p. 58). 

What, then, can we conclude about the effect of within-class ability 
grouping? At present, the best evidence in this area comes from SlavhVs 
review, and, based on this, it seems safe to conclude that at least in 
mathematics and on achievement grounds, within-class ability grouping 
should be supported for ail students, including low achievers. However, and 
despite the results presented by Wilkinson, it docs not appear reasonable to 
conclude at this time that either homogeneous or heterogeneous within-class 
ability grouping is preferred for delivering reading instruction to low-ability 
students. As suggested by Slavin, such a conclusion can be reached only 
when experimental comparisons of grouped and ungrouped reading classes 
have been performed. 

Turning now to between-class ability grouping, Glass and Smith (1977) 
called the research literature on ability grouping "a confusing welter of 
contradictory, tendentious, antiquated and dubious studies" (p. 32). None- 
theless, drawing from the reviews of Miller and Otto (1930), Billet (1932), 
Cornell (1936), Ekstrom (1959), Eash (1961), and Goldberg, Passow, and 
Justman (1966), Glass and Smith (1977) concluded that ability grouping is not 
an important variable in itself, and that it is worth considering only as it 
relates to or potentiates other changes in instructional activities. They also 
:oncluded that 4 4 the accumulated research literature of several decades 
shows such as unstable pattern of what takes place between teachers and 
pupils in homogeneous as opposed to heterogeneous classrooms that no 
generalizations arc possible (or useful) about the effects of ability grouping" 
(pp. 34-35). 

Kulik and Ku!ik (1982) conducted a more recent meta-analysis of the 
effects of ability grouping on secondary school students. They found that in 
thirty-six of the fifty-one studies looking at achievement, the performance of 
students in grouped classes was better than the performance of students in 
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ungroupcd classes; in fourteen other studies, the performance was better in 
ungrouped classes; and in one study, there was no difference. A total often 
of the comparisons were statistically significant, eight of which favored 
grouping and two of which favored what Kulik and Kulik called "conven- 
tional instruction." Tr . effect size was small, however, averaging about 
one-tenth of a standard deviation across studies. In the four programs 
designed especially for academically deficient students the effects were even 
smaller. In fcet, they were near zero, indicating that students learned as 
much in mixed ability classrooms as they did in homogeneous classrooms. 
Kulik and Kulik did find, however, that homogeneous grouping was 
effective for high a. hievcrs. 

Slavin (1984) criticized the Kulik and Kulik (1982) review on the 
grounds that many of the studies they included used matching or other 
nonrandom procedures to attempt to equate groups on initial ability, and 
that this procedure would result in selection bias. Moreover, Siavin argued 
that in studies of high achievers, this bias should favor homogeneous 
assignment for gifted students, as was found by Kulik and Kulik. Slavin also 
^argued that since the mean effect size for studies using random assignment is 
'zero, Kulik and Kulik's claim of positive effects of ability grouping rests 
entirely on nonrandomized studies. 

In his own review of the ability grouping literature, Slavin (1986) 
argues that research clearly docs not support the assignment of students to 
self-contained classes according to ability, nor docs it support the conclu- 
sion that ability grouping is beneficial for high-ability students and detrimen- 
tal for low-ability students, as suggested by Eash (1961), Esposito (1973), 
and Bcglc (1975). The literature docs suggest, however, that cross-grade 
tutoring and comprehensive nongraded plans can be effective. Particularly 
noteworthy in this regard arc the fourteen studies reviewed by Slavin using 
Joplin or Joplin-likc plans. For eleven of these studies the Joplin plan classes 
achieved more than the control classes; for the remaining three studies no 
differences were found. In addition, the median effect size for these studies 
was found to be about 0.45. The results for the comprehensive nongraded 
plans are less consistent than for the Joplin-like plans, but the median effect 
size is still moderately positive (ES - .29). Based on this, Slavin concludes 
that the evidence supports the use of these plans. He also suggests that while 
the results of studies of regrouping for reading and mathematics arc 
inconclusive, it is still quite possible that grouping for one or two subjects is 
instructional^ effective. 

Related to the literature on ability grouping is an increasingly large 
body of research on the relative effects of mainstrcaming versus special 
class placement. Due in large part to the impact of the !975 Education of All 
Handicapped Children Act (PL 94-142), which mandated that handicapped 
children had to be educated in the least restrictive environment, most of the 
recent research has focused on this special population, particularly the 
mildly handicapped, who prior to 1975 had most often been served in 
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self-contained settings completely separate from the public schools. Be- 
cause the results of tnese studies are not directly generalizable to the types 
of students served by compensatory education programs, these mainstream- 
ing findings must be interpreted cautiously. However, it should also be 
recognized that the findings are not without merit and may be applicable to 
those now served by Chapter 1 and other state and local programs for the 
academically disadvantaged. In fact, as suggested by Lcinhardt, Bickei, and 
Palley (1982), it may well be that some treatments are effective for both the 
mildly handicapped and the academically disadvantaged. 

Leinhardt and Palley's (1982) informative review of the literature on 
mainstreaming and restrictive educational settings concluded that while 
there is a trend toward negative effects of isolated settings for low-ability 
children, it is neither strong nor highly consistent. They also concluded that 
children on the borderline of the cutoff for special placement suffer more 
when they arc isolated than do children who arc closer to the median of the 
group of children that receive special services. Finally, they concluded that 
the setting itself (i.e., mainstreaming versus isolated setting) is not a 
powerful direct inference on student achievement, but that it indirectly 
influences other relevant variables, most of which can occur in most 
settings. 

Like Lcinhardt and Palley (1982), Madden and Slavin (1983) also 
concluded from their review of the literature that full-time special education 
placements resulted in few consistent benefits for the mildly academically 
handicapped (i.e., learning disabled or cducablc mentally retarded). 
However, Madden and Slavin went further by concluding that regular class 
placement (i.e., mainstreaming) was preferred to special class placement on 
achievement grounds. The best evidence for this conclusion, Madden and 
Slavin argue, is the research of Calhoun and Elliott (1977) in which fifty 
black educable mentally retarded (EMR) students and fifty black emo- 
tionally disturbed students were randomly assigned to regular and special 
classes and followed over a three-year period. These researchers found that 
placement in regular classes had a more positive effect on the achievement 
of both groups that did special class placement. Interestingly, Madden and 
Slavin (1983) also cite a study by Leinhardt (1980) as support for rcgulur 
class placement. In this study: 

Low achieving students in the regular class made significantly greater gains tn 
reading achievement than did students in the special transition rooms, despite 
the fact that the transition rooms, which used the same curriculum, had much 
lower student teacher ratios and were staffer' by some of the schools "best" 
teachers (Madden and Slavin, 1983, p. 524). 

Madden and Slavin admit that their conclusions are based heavily on these 
two studies. However, they argue that their conclusion is warranted since 
the two studies are particularly well designed and their effects are quite 
large. 



248 Instructional Setting 



Despite his general conclusion that regular class placement is preferred 
for the mildly academically handicapped, Slavin suggests that the "effects of 
regular class placement depend substantially on the program in the regular 
class" (1986, p. 530). Specifically, he argues that individualized instruction 
is required for the positive effects to occur, and that this applies to 
social-emotional outcomes as well as achievement. He also suggests, 
however, that part-time resource programs can have a positive impact on the 
achievement of the mildly academically handicapped if they are of good 
quality, again hinting that this implies that individualized instruction occurs. 
In this regard, Slavin seems to be at odds with Brophy (1986) and others who 
arc calling for less individualized instruction and more direct or active 
instruction. 

Although Slavin is enthusiastic about regular class placement, he 
recognizes that the placement of mildly academically handicapped students 
in regular classes is fraught with problems. Included among these is the fact 
that regular class teachers feel poorly prepared to accommodate the needs of 
the mildly academically handicapped and that LD and EMR students in 
'regular classes are l5>s accepted and more rejected than their nonnandi- 
capped peers. Slavin describes a variety of programs for overcoming these 
problems, and proposes that these programs and strategics be used either to 
restructure the regular classroom or to construct resource programs to 
supplement instruction in the regular class. Thus, in the language of ability 
grouping, Slavin appears to be calling for heterogeneous groupings of 
students, including the mildly academically handicapped, within-class. He 
also suggests, however, that some form of regrouping for certain types of 
instruction, probably employing pullout since he uses the resource room 
terminology, may be required. 

But even if pullout programs can be justified on academic grounds, 
removing children from regular classrooms for compensatory instruction 
creates the possibility of rescgrcgation. On this score, Glass and Smith 
(1977) have argued that pullout programs do not alter the racial mixture of 
schools but they may and probably do change the racial composition of the 
classes within them. Eylcr (1982/ is less equivocal. Citing the work of 
Hinklcy, Bcal, and Brcglio (1978) and Brookovcr, Btady, and Warfield 
(1981), she argues that minority students receive larger amounts of compen- 
satory instruction than nonminoritics and that they arc typically pulled out 
of less segregated classrooms and sent to more segregated special rooms to 
receive their instruction. Citing the work of Kimbrough and Hill (1981), 
Eylcr argues that the potential for rescgrcgation through compensatory 
services is exacerbated in schools that operate more than one categorical 
program and have substantial numbers of students who are eligible for more 
than one type of service. She also argues that the negative effects of 
rcsegregation arc not offset by puliouf s effects on achievement. To avoid 
problems created by pullout, Eylcr recommends that compensatory services 
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be delivered in class by specialists who act as consultants rather than simply 
as subject matter specialists. In this redefined role they would serve as 
resources to the regular classroom teacher rather than instructors of specific 
groups of students. They would assist teachers in assessing specific learning 
problems and preparing individual learning plans, train classroom aides and 
parent volunteers, and help the principal plan the schoolwidc instruction 
program. 

On the question of race relations, Slavin (1979, 1980) and Slavin and 
Madden (1979) reported that working together on school projects with a 
student of another race and competing together on interracial sports teams 
both had strong positive effects on racial attitudes and behaviors. However, 
teacher workshops, minority history, multiethnic texts, bisocial student 
advisory committees, and similar school programs made no difference in 
racial attitudes or interracial friendships. Given these results, it would 
appear that if pullout programs do lead to rcscgrcgation, as Eyler (1982) and 
others have argued that they do, the resultant reduction in contact between 
f minority and nonminority students will rfducc the possibility of improved ^ 
relations among the races. To the extent that this occurs, it must be viewed 
as a serious negative consequence of homogeneous grouping that uses the 
pullout model. 

TYPE OF INSTRUCTOR 

As noted earlier, instructional specialists provide the large msyority of 
compensatory instruction. Generally, these specialists have a higher level of 
educational attainment than regular classroom teachers. In general, they 
also have less experience, and, because of this, specialists are likely to 
receive less pay. Despite the widespread use of these better trained but less 
experienced teachers in compensatory education programs, the literature 
tells us surprisingly little about the effect they have on students. Glass and 
Smith (1977) report on two studies that claim to show that remedial reading 
specialists have a positive effect on reading performance (Flynn, Hass, and 
Al-Salam, 1976; KiesWng, 1971). However, neither this review nor the one 
reported here was able to uncover any additional information on this issue. 
Perhaps because of this lack of information, Ligon and Doss (1982) have 
suggested that compensatory funds be used to purchase additional 
classroom teachers rather than reading specialists, thereby reducing the size 
of the regular classes. Allington (1986), on the other hand, believes there is 
ample support for the continued use of reading specialists. He does feel, 
however, that reading specialists will be under increasing pressure to justify 
their positions, particularly to their classroom teacher peers. 

Concerning the nature of their services, Eyler (1982) has suggested thai 
the role of compensatory instructional specialists should be redefined to 
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require them to work in in-class settings and serve as resources to the 
classroom teacher rather than as instructors of specific groups of students. 
Interestingly, Bean and Eichclbcrgcr (1985) have reported that reading 
specialists perceived that they placed less emphasis on both diagnosis of 
student weaknesses and leaching specific skills and more emphasis on 
working with content teachers when their school district changed from a 
pullout to an in-class approach. Classroom teachers concurred with spe- 
cialists and also reported that specialists spend more lime working with 
them. Despite the similarity between what Eylcr (1982) proposed and what 
actually occurred in the Bean and Eichclbcrgcr (1985) study, the latter 
authors reported that teachers and specialists both agreed that reading 
specialists should focus on working with students with reading problems. 
Neither group saw the reading specialist as a resource to the classroom 
teacher, although both groups valued having interaction about specific 
children. 

Shifting our attention to instructional aides, Schuctz (1980) reports that 
the evidence about their effectiveness is mixed. Qualitative studies have 
found that teachers using instructional aides spend more time on instructional 
activities, deliver more of their instruction in individual and small groups, 
feci they accomplish more, and feel that instructional aides arc helpful! 
Quantitative studies of their impact on student achievement, however, arc 
less conclusive. In fact, Schuctz reports positive, negative, and no effects 
attributable to instructional aides in her comprehensive review of the 
literature. Despite the lack of convincing evidence for their effectiveness, 
Schuetz provides useful information on the conditions that must be satisfied 
if aides are to be helpful: (I) teachers who use instructional aides must be 
genuinely rcccpmc to them; (2) instructional aides must have the necessary 
literacy and computational skills to perform some useful instructional tasks 
in the classroom; (3) classroom management plans must be developed to 
take advantage of the presence of instructional aides; and (4) instructional 
aides must receive intensive training in the tasks they will be asked to 
perform. 

Viewed as a whole, it is clear that the literature concerning the impact 
on student performance of specialist teachers and instructional aides i>, 
inconclusive. This may not be surprising, since by this lime in the chapter 
most readers may have grown accustomed to equivocal results. What may 
be surprising, however, is the lack of evidence about the effectiveness of 
instructional specialists and instructional aides regardless of the type of 
instructional program or activity in which they arc involved. Perhaps 
Allmglon (1986) is right in arguing that reading specialists will be called on 
increasingly to justify their positions. Whether he is or not, research clearly 
needs to be performed to determine exactly what impact reading specialists 
have on student performance and whether the continued use of compen- 
satory education funds to purchase their services is warranted. 
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SUMMARY AND CONCLUSIONS 



Since the passage of the Elementary and Secondary Education Act of 1965, 
billions of dollars have been spent each year to attempt to provide improved 
instruction for the poor and educationally disadvantaged. Despite the 
magnitude of this investment and the commitment of the many thousand 
people who have. labored to ensure that tnese monies arc well spent, the 
impact of compensatory education, although measurable and significant 
(Carter, 1984: Forbes, 1985; Stickncy and Plunkctt, 1982; Wang, 1980), has 
nonetheless been viewed as disappointing by some (e.g., Hallinan. 1986) and 
less than hoped for by others (e.g., Cooley, 1981). Coolcy (1981), however, 
has also argued that the search for overall cflccts of compensatory educa- 
tion is an inappropriate goal, He suggests that researchers and c valuators 
should stop trying to justify this federal funding program and focus instead 
on ways in which compensatory education can be improved. With this goal 
in mind, a review of the literature was undertaken to determine what is 
known about the effectiveness of certain design features of programs for the 
educationally disadvantaged. Specifically, the review was concerned *ith 
the cflccts of particular instructional settings (i.e., whether the instruction 
was delivered in class or outside the regular classroom in a pullout setting), 
class size, instructional grouping practices, individualized instruction, and 
instructional specialists and aides. 

Although recent research indicates that the in-class approach is being 
used more frequently (Advanced Technology, 1983), the literature clearly 
shows that most compensatory instruction is delivered in a pullout setting. It 
also shows that compensatory instruction is predominantly rcadtng and 
mathematics instruction, and that it is provided primarily by instructional 
specialists and, to a lesser extent, instructional aides who work with smaller 
groups of students and provide more individualized instruction than regular 
classroom teachers. 

Tbc research describing the impact of pullout programs on the achieve- 
Rittit of compensatory education students is less clear. In fact, based on 
what wc know at this fiinc. one would be unable to provide convincing 
evidence for the effectiveness of either the pullout or in-class approach. 
Given these mixed results, however, one could and should argue that the 
widespread use of pullout programs is not justified. Moreover, since there is 
some evidence to suggest that pullout may result in rcscgrcgation, there is 
further reason to question its wholesale adoption. But that does not mean 
that pullout programs should be abandoned. Rather, it means that the 
programs implemented should fit local conditions. 

Critics of pullout have suggested that it unnecessarily labels and 
stigmatizes students, that it reduces communication, cooperation, and 
coordination between regular and compensatory instructors, that it reduces 
teacher responsibility for pupil progress, and that valuable content is lost 
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when pullout instruction is received. Despite the importance of these 
criticisms, counterarguments also appear to have some value. Labeling and 
stigmatization can occur in in-class settings as well as in pullout settings. 
Moreover, it may be that it is not the setting itself that aggravates a 
compensatory education student's feelings of being different, but the 
sensitivity or insensitivity of teachers operating within the setting (Nod- 
dings, 1978). It also appears that even though an in-class program makes it 
easier and more likely for communication, cooperation, and coordination to 
occur, it docs not guarantee that they will. There is also some evidence to 
suggest that certain types of teachers arc just not able to work together in a 
single classroom and that joint planning may take place in much the same 
way in in-class programs as in pullout programs, before school, during 
recess, during lunch, and after school. Because some teachers may not be 
able to accommodate another teacher in the same classroom, the issue of 
teacher autonomy and responsibility for student progress is also not 
automatically resolved by in-class programs. Perhaps, as suggested by 
Kennedy (1978), the key to good coordination between compensatory and 
regular instruction is not the setting for instruction but rather the quality of 
the joint planning that occurs, the amount of decision-making power given to 
teachers and specialists by their administrators, and the flexibility in 
scheduling allowed by the environment. 

With regard to the content lost issue, it is clear that some instruction is 
sacrificed when pullout compensatory instruction is received. It is also clear 
that valuable instructional time is lost in the process of moving to and from 
the pullout environment. It is not clear, however, what types of instruction, 
if any, would be lost if compensatory instruction were delivered in class. It is 
also not clear whether the interruptions resulting from students being pulled 
out for instruction arc less harmful than the confusion resulting from two 
teachers working in the same classroom. More research needs to be done on 
these issues. 

Those who favor pullout programs herald the virtues of specialist 
teachers and aides, smaller class sizes, increased opportunities for individu- 
alization, and more effective grouping practices. What do we know about 
their effectiveness? Surprisingly, we know very little about the impact of 
instructional specialists and aides on student performance. 'In fact, this 
author has been able to uncover only two studies that claim to show that 
reading specialists have a positive effect on achievement (Flynn, Hass, and 
Af-Salam, 1976; Kiesling, 1971). Allington (1986) believes there is ample 
support for the continued use of reading specialists, but his arguments arc 
not particularly convincing. Moreover, he cautions reading specialists that 
in the future they will be under increasing pressure to justify their positions, 
particularly to their classroom teacher peers. Regarding classroom aides, 
Schuctz (1980) reports that quantitative studies lead to no clear conclusions 
about their effectiveness. She docs suggest, however, that some teachers 
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can use them effectively, but that in order for this to occur, teachers must be 
genuinely receptive to them, the aides mus: have the necessary literacy and 
computational skills to perform useful instructional tasks, they must receive 
intensive training in the tasks they will perform, and effective classroom 
management plans must be developed to accommodate them. 

Despite the paucity of hard, convincing evidence about the effective- 
ness of instructional specialists and aides, they continue to be used as the 
primary agents of compensatory instruction. In the early days of Title I, 
their use was clearly justified given the legal requirements surrounding the 
allocation of these federal monies. In fact, it has been argued (Archambault, 
1986) that these requirements were precisely the reason that pullout pro- 
grams, with the accompanying specialists and aides, became so popular. At 
present, however, there arc fewer constraints. And yet, even the in-class 
models that are being implemented use instructional specialists or aides as 
the service providers. We need more research on their effectiveness; we 
need more creative approaches to their use. 

Turning now to the class size issue, with the exception of the work of 
Slavin (1984), there seems to be general agreement that class sizes of less 
than fifteen optimize learning outcomes. Moreover, and as argued by 
Cooper (1986), the effects arc most dramatic in clashes containing fiom one 
to five students, the size of the group in which the majority of compensatory 
education students receive their instruction (Archambault and St. Pierre, 
1979). Thus it would appear that smaller class sizes arc to be favored. But 
what of the composition of the groups within these classes? Is heteroge- 
neous or homogeneous grouping preferred? 

Slavin's (1986) comprehensive review of the effects of instructional 
grouping strategies offers the best organization of the literature on this 
subject as well as some of the best insights into what that literature says. 
Slavin distinguishes within-class and between-class ability grouping, and he 
considers the effects of each separately. He concludes that homogeneous 
within-class ability grouping leads to improved student performance in 
mathematics and that this effect holds for students of all ability levels. This 
conflicts with the conclusions of Eash (1961), Pcrscll (1977), Wilkinson 
(1986), and Hallinan (1986) that homogeneous ability grouping is detrimental 
to students assigned to low groups. One explanation for this difference is 
that Slavin focused on mathematics, whereas the other reviewers, particu- 
larly Wilkinson (1986), focused more on reading. More important that this, 
however, is the fact that much of the research cited by Wilkinson deals with 
observations that students in low reading or math groups learn less than 
students in higher groups, not less than other low-achieving students in 
contrasting arrangements. Like Rcichardt (1979). Slavin argues that when 
groups arc not equivalent prccxpcrimcntally (i.e., when there is no random 
assignment to groups) such comparisons will tend to favor higher achieving 
groups. 
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Taken together, then, the research suggests that for mathematics 
within-class ability grouping should be supported for all students, including 
low achievers. However, it docs not appear reasonable to conclude at this 
time that cither homogeneous or heterogeneous within-class ability grouping 
is preferred for delivering reading instruction to low-ability students. As 
suggested by Slavin, such a conclusion can be reached only when experi- 
mental comparisons of grouped and ungrouped reading clashes have been 
performed. 

Turning now to between-class ability grouping, a wealth of advice is 
available in the literature. Some reviewers have concluded that no general- 
izations arc possible about the effects of ability grouping (Glass and Smith, 
1977), others have concluded that while the performance of students in 
grouped classes is better than the performance of students in ungrouped 
classes there appears to be no systematic effect for low-ability students 
(Kulik and Kulik, 1982), and still others have argued that ability grouping is 
detrimental for low-ability students (Bcglc, 1975; Esposito, 1973). Slavin 
(1986), who looked at the effects of seven types of between-class ability 
grouping strategics, concluded that research docs not support the assign- 
ment of students to self-contained classes according to ability, nor docs it 
suggest that ability grouping is harmful for low-ability students. It docs 
suggest, however, that cross-grade tutoring and comprehensive nongraded 
plans can be effective. It also suggests to Slavin that although the results of 
regrouping for reading and mathematics arc inconclusive, it is still possible 
that grouping for one or two subjects is instructional^ effective. 

The last of the practices reviewed in this chapter was individualization. 
This would appear to be a particular benefit for compensatory education 
students since the reduced class size in which they receive their compen- 
satory instruction should result in more individual attention. However, 
Cahcn and colleagues (1983) and Harnischfcgcr (1980) have both concluded 
that smaller class size docs not necessarily lead to increased individualiza- 
tion. Coolcy and Lcinhardt (1980) have also concluded that there is not any 
clear evidence for the superiority of individualized instruction over other 
instructional strategics. Brophy (1986) argues that the problem is not with 
the conceptual model of individualization, which calls for beginning where 
students arc and moving them along at their own pace. Rather, the problem 
is that as implemented individualized instruction shifts the responsibility for 
teaching fo the student. This docs not work when used over extended 
periods of time. It can be used effectively, however, in classrooms with 
small student/teacher ratios, where the teacher is able to provide close 
supervision and where other types of instructional methods arc used in 
conjunction with individualized instruction, particularly what Brophy calls 
"active instruction.** 

What do these results suggest for practice? What advice can we offer to 
those charged with the implementation of compensatory education pro- 
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grams? Under the best of conditions, the translation of research into practice 
is a risky business. When the evidence is not clear-cut the perils arc even 
greater. Researchers need to recognize this. They also must recognize that 
practitioners want and need help and that the practice of schooling cannot 
wait for unambiguous research results. With this in mind, and with no little 
amount of trepidation, the following advice is given: 

1. Setting: Based on what we know at this time, it is safe to conclude that 
setting is not directly responsible for student outcomes and that the issue of 
effective practices within settings, and not the setting itself, should be the 
focus of our attention. This implies that educators can more confidently 
select a setting based on local conditions, perhaps using pullout approaches 
in those areas where rcscgrcgation would not occur, where teachers are 
committed to the use of these practices, where coordination and cooperation 
is possible, and so forth. But, where these and certain other conditions are 
not met, and perhaps even within the same school district, in-class programs 
should be used. 

2. Specialists ami Aides. It is clear that classroom teachers need help in 
meeting the needs of the academically disadvantaged. It is less cleat whether 
that help should be provided by instructional specialists, aides, or some 
other type of teacher. Since there is so little evidence telling us what 
combinations of instructors and conditions are likely to maximize student 
performance, the best advice is that school districts should again feel 
confident in using whatever combination appears to meet local needs. In 
certain situations it may be best to have specialists work with students in a 
pullout setting; in others it may be mors effective to introduce a specialist or 
aide into the regular classroom. If the latter option is chosen, it may be better 
in some situations for the specialist to work with compensatory education 
students; in others the skills of the teacher and specialist may dictate that the 
specialist should work with other students or groups while the classroom 
teacher concentrates on the needs of the compensatory education student. 
In determining which option is best, school administrators should honestly 
assess the competencies of their staff and attempt to match them to the 
needs of students. They should also consider the needs of teachers and their 
preferences for and their ability to adapt to the various options. Finally, and 
no less importantly, they should determine which of the options is likely to 
deliver the highest-quality instruction to the student. Key issues in this 
determination should be which program will ensure the greatest amount of 
engaged time in specific subject areas, the best communication, cooperation, 
and coordination among the regular and compensatory program, the greatest 
integration of instruction across these two domains, and the fewest negative 
consequences such as the stigma attached to receiving compensatory 
instruction. 
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3. Class Size and Grouping: Smaller class sizes arc preferred, particularly 
when they include fifteen or fewer students. Educationally disadvantaged 
students benefit from such arrangements, as do other students in the class. 
Educationally disadvantaged students also benefit from classes that arc 
heterogeneous in ability, and they do so at no expense to the higher-ability 
students. This docs not deny the advantages of within-class ability grouping 
for certain subjects, particularly reading and mathematics, nor the possi- 
bility that cross-grade grouping for certain content would also be beneficial. 
In fact, both of these practices are to be encouraged. 

4. Individualization: Individualized instruction can be harmful when used 
as the sole method of instruction. That is not to say that teachers should not 
determine the individual needs of students and help them to progress at their 
own pace, it docs mean, however, that teachers should not transfer the 
responsibility for this progress to students laboring on their own for 
extended periods of time on worksheets and other published materials. Said 
in a different way, teachers must take a much more active role in instruction 
than the practice of individualization has come to impl> . Individualization 
can be used effectively, however, in classrooms with small student/teacher 
ratios where the teacher is able to provide close supervision. 

From a number of quarters and for a number of reasons, effective 
educational practices arc now being emphasized in the literature. Edmonds 
(1979) and Brookovcr, Schweitzer, Schneider, Beady, Flood, and Wiscn- 
bakcr (1978) talk about effective schools, that is, schools in which (!) there 
is a commitment to particula- instructional models and programs, (2) there is 
agreement on goals and objectives, (3) principals assume a leadership role in 
the determination of what instruction is to be offered and how it is delivered, 
(4) administrators and teachers have high expectations for students and 
communicate these expectations to them, (5) there is frequent monitoring 
and feedback of student performance, and (6) there is effective teacher 
training. Lcinhardt and Pallcy (1982), looking more closely at classrooms, 
call for smaller classes, mastery learning (i.e., clear goal statements and 
regular monitoring of stiidcnt progress), more time spent in cognitive 
activities, reasonably rapid pacing of instruction, using a formal manage- 
ment system that encourages task orientation and successful completion of 
tasks, and increased teacher instructional time. Brophy (1986), echoing 
some of Lcinhardt and Pallcy's comments, would use this increased time for 
instruction for what he calls active teaching. This connotes frequent 
presentations by the teacher in which she or he develops concepts through 
lecture and demonstration, follows those up with recitations by students or 
dialog with them, prepares the students for assignments by giving instruc- 
tions and going through practice examples, monitors progress on assign- 
ments, and follows up with appropriate feedback and rctcaching when 
necessary. There is a great deal of teacher talk, but most of it is academic 
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rather than procedural or managerial, and most of it involves asking 
questions and giving feedback rather than extended lecturing. 

These and other researchers have taught us a great deal about what 
makes schools and the people within them effective. Nonetheless, we have 
much more to learn. 
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INSTRUCTIONAL ACTIVITIES 
RELATED TO 
ACHIEVEMENT GAIN IN 
CHAPTER 1 CLASSES 

John Crawford 

Planning, Research, and Evaluation Department 
Oklahoma City Public Schools 



On a Monday morning in March at 8:42, nine students come in for their 
second-grade reading lesson. They arc met by the Learning Center teacher 
and aide. The setting is a school serving a relatively lower socioeconomic 
status constituency—more than one-half of the students in the school 
qualify for free or reduced-payment lunches. The students in the class have 
qualified for compensatory education with scores below the 40th percentile 
nationally in reading. The resources for the p. Ilout program teacher, the 
small class size, the extra materials, and the paraprofcssional aide come 
from the federally funded Title I program (now Chapter I). 

What the observer (also the author of this chapter) did not know at the 
time was that this teacher would have students with the highest achievement 
gains of any of the scvcnly-ninc teachers in the study. 

The fact that all the students arrived at the same time (and on time) 
suggests some coordination with the regular classrooms and/or a system 
whereby student transition time is monitored and minimized. 

Within seconds of the students' arrival, the teacher has greeted 
everyone and has them all on the rug for whole-group activity. For about 
eight minutes the teacher reads a story to the students and asks comprehen- 
sion questions of the whole group. Students respond chorally most of the 
time. The narrative comments for those minutes noted: 

At 8:42, kids come in and sit together u> hear Teacher read story on Silly Sam 
and the Singing Fish 
Kids very attentive. 

At 8;50, end of story. Kids break out to work individually. 
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From 8:50 to 9:20 the students worked individually on worksheet-type 
activities. However, in addition to pencil and paper, the activities for six of 
the nine students involved some kind of audiovisual equipment (audio 
cassette, video reels with sound, etc.). During this time of individual work, 
the teacher and the aide are both quite busy. While students are engaged 
with their individual tasks, the teacher and aide arc circulating to the 
students. Although all students arc contacted, more time is allocated to the 
students with the greater need for help. 

When procedure-related interactions occur (such as helping a student 
with some of the audiovisual equipment), they arc mostly likely to be brief. 
However, when the teacher initiates an academic contact with a student, it 
was more likely to be coded as "long" or "very long" (in the coding 
scheme, "long" was more than a brief phrase or interaction but less than 
thirty seconds; "very long" was defined as more than thirty seconds with 
the same student). 

Narrative notes indicated that: 

From 9. 13 on. Teacher with one boy for very long interaction (approximately 2 
to 3 minutes). 

As the teacher was working with a student individually, the aide was 
moving around the room to monitor and help other students. 

At 9:20, the teacher told the students to begin getting ready to go, and 
by 9:22 the students were off to their regular classrooms. 

More than 90 percent of the available time had been devoted to the 
academic content. During the forty-minute observation, only two student 
misbehaviors were coded. Both were "mild" in intensity (not distracting to 
the whole class), and were efficiently dealt with by a "mild desist" from the 
teacher. The teacher did not make a disciplinary error (behavior manage- 
ment codes were based on the work of Kounin, 1970). 

This teacher placed a high priority on mastery of academic content. 
Good use was made of the 4.5 to 1 stttdcnt-to-adult ratio by individual 
instruction (private one-to-one contacts) for the majority of the lesson. 
About one-fifth of the time was spent in a nonthrcatcning whole-group 
exercise. There was a pleasant affective climate and the students were 
actively engaged with their reading lessons. 

Over the course of the school year, the average gain of this teacher's 
students was 13 NCE (Normal Curve Equivalent) points on total reading. 
NCE scores were developed as an equal interval scale for measuring 
achievement; NCEs agree with percentiles at L 50, and 99. Depending on 
where one starts on the percentile scale, a 13-point NCE gain could result in 
a 15- to 20-point percentile gain (e.g.. a student increasing from the 10th to 
the 26th percentile, or a student going from the 20th to the 40th percentile). 

The above description resulted from low inference coding, time-use 
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description, and narrative records of a particular teacher's class— an 
elementary lesson in reading. It is of some interest that the teacher with the 
highest-gaining students displayed integrated coordination of the same 
complex of behaviors that were found in the full sample to be correlated with 
greater achievement growth. (Sec Shulman, 1986 for speculation that 
process-product research has yielded little evidence that any individual 
teachers behave in ways that the research indicates *\ effective.) 

This chapter covers currently available information on what works in 
Chapter I compensatory programs. Characteristics of programs labeled 
"exemplary" will be compiled with research findings into a picture of 
effective instructional principles for Chapter I classes. 



IDENTIFICATION OF EFFECTIVE 
CHAPTER 1 PROGRAMS 

The US Department of Education (DOE) recently undertook an effort to 
identify successful compensatory programs across the nation. The rationale 
and results are reported in a two-volume sourcebook prepared by Griswold, 
Cotton, and Hansen (1985). Volume I presents a review of school and 
teacher effects research and synthesizes the findings into thirteen attributes 
of effective programs. Volume II presents brief profiles of each of the 
projects selected as successful. 

The identification of compensatory projects w orthy of national recog- 
nition was modeled after the earlier DOE Secondary School Recognition 
Program. Nominations were requested from state educational agencies, and 
following a review panel consideration of program documentation, the final 
selection of effective projects was completed. The state agencies nominated 
334 projects nationwide. Of that number, 1 18 were selected as most effective 
and ready for dissemination. Two of the 1 18 withdrew, so 116 projects are 
described in the sourcebook. 

Projects were selected on the basis of evidence of effectiveness in one 
or more of three areas of performance: achievement (relatively short-term 
g;» ; ns or point-in-timc measures), sustained gains {at least three measures of 
achievement over at least one year), and other outcomes such as attendance, 
attitude, improvement in discipline, and so on. Almost all (1 14 of 116) of the 
cited projects presented evidence of effectiveness in achievement in basic 
skiils. A total of 45 projects reported sustained gains. And 31 of the identified 
programs documented other (affective) outcomes associated with their 
projects. 

The schooling attributes related to the previously described outcomes 
were termed organizational and hist nut tonal characteristics of effective 
programs The organizational attributes included school and parent/ 
community influences, instructional attributes referred to maieuals. 
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grouping methods, use of time, interactive teaching, high expectations, 
feedback, and reinforcement. Each of the organizational and instructional 
attributes are described briefly below. 



ORGANIZATIONAL ATTRIBUTES OF 
EFFECTIVE PROGRAMS 

Positive Climate 

A school with orderly classrooms, high expectations for student achieve* 
ment and behavior, a pleasant physical environment, and well-established 
rules is one with a strong positive chniatc. School cftccts research (e.g., 
Brookover. Schweitzer. Schneider, Beady, Flood, and Wiscnbakcr, 1978: 
Crawford and Watson, 1985) has consistently shown that high-achieving 
schools also tend to have a positive climate for learning. For example, 
successful Chapter I programs identified in the sourcebook discussed the 
process of gaining visibility and a positive image for the program within the 
school, involvement in a districtwidc climate committee, and maintenance 
rf a resource room that is pleasant, well-lighted, and colorful. 

Clear Goals and Objectives for the School 

The school's goals and objectives should be articulated clearly and should 
arise from needs identified by parents, staft, and the community. Once set, 
the goals and objectives are communicated to interested parties. One of the 
Chapter I programs identified as exemplary used a systematic needs 
assessment where input was sought from civic clubs, the local Clumbci of 
Commerce, parent groups, and school staff. Based on the survey results, 
program staff set specific objectives for the Chapter I project. 

Coordination 

Many of the successful Chapter I piojccts cited in the sourcebook included 
as part of their documentation a discussion of how they cootdinated their 
program with the regular classroom instruction or with other special 
programs- The principal plays a key roic in fostering cooperation among 
programs. At effective schools, principals and other leaders in the school 
encourage planning groups and provide opportunity for them to work 
together. Coordination efforts work to align curricula and objectives across 
programs. Examples from successful Chapter I programs included, use of 
daily log sheets that record Chapter I and regular teacher communication 
(reviewed by coordinators and principals), staff meetings where objectives 
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and instructional activities of the Chapter I program and the regular 
classroom arc examined and made to match, and use of information on how 
well instruction is coordinated with the regular classroom in evaluating 
Chapter 1 teachers. 



Parent/Community Involvement 

Although general support by parents and community (e.g., PTAs, Adopt-a- 
School programs) may be only weakly related to program and school 
effectiveness, the direct involvement of parents in their children's learning 
can be a potent facilitator of student achievement. According to Volume 1 of 
the sourcebook "The most effective parent involvement emphasizes pro- 
viding parents with training to extend the classroom into the home" (p. 31). 
For example, one Chapter 1 program sent a traveling resource center to 
schools to allow parents to check out learning materials for use with their 
children at home. One such course suggested forty ways that parents could 
work with their children. Parent support was also given by inscrvice 
training. 



Staff Development and Training 

Professional development activities form a portion of the profile of the 
effective school. Effective principals encourage staff development with 
opportunity and incentive. One effective Chapter I program involved their 
staff in: the development of inscrvice activities, focusing of the content of 
the inscrvice on research-based effective practices, and evaluation of the 
inscrvice in order to plan subsequent training sessions. 



Use of Evaluation Results for Program Improvement 

Although the norm for Chapter I projects may be little use of evaluation 
results (sec David, 1981), many effective programs did indicate that evalu- 
ation findings were used to improve the program. Commitment to the use of 
evaluation as an integral piece of school improvement was an effective 
attribute. One successful Chapter I program (in the Oklahoma City Public 
Schools — also the source of the process-product relations to be reported 
later in this chapter) used evaluation results to change student eligibility 
criteria and to derive content for inscrvice packets based on local teaching 
effectiveness studies in Chapter 1 classes. In another program, evaluation 
results led to the development of a management system to improve 
coordination between the regular classroom and Chapter I. 
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Strong Leadership 



One of the more consistent findings from school effects research indicates 
that effective schools have strong instructional leaders. Instructional leader* 
ship (typically by the principal) creates a climate within which student 
learning is a top priority. Such leadership engenders a focus on school or 
program goals and motivates teaching staff. Examples of successful pro- 
grams documented close coordination between a strong principal and 
project director. Also, the project director may assume the leadership role in 
fostering a focus on goals. 



Instructional attributes arc those characteristics presumed to occur within 
classes (as opposed to the school-level organizational attributes just dis- 
cussed). These factors arc more under the control of teachers than the 
organizational attributes. 



This broadly conceived category actually includes several specific attri- 
butes: "interactive" teaching, ability grouping and individualized instruc- 
tion, mastery learning, adaptive learning environments, cognitive education, 
and cross-age tu'oring. 

Interactive teaching, as referred to by the sourcebook, is a combina- 
tion of direct instruction and active teaching (Broph> 1979; Good and 
Grouws. 1979; Roscnshinc, !983), Teachers skilled in ii;is method of 
teaching follow a sequence of presenting concepts, explaining, providing 
practice activities, and actively monitoring student learning. They fre- 
quently ask questions and provide feedback to students, 

Ability grouping in the lower grades is common practice. Some 
research in regular classes suggests that the lower-ability groups may have 
lower expectations communicated to them by the kinds of activities in which 
teachers engage them. For example, the lower group may spend more time 
with flash cards and less time reading stories, and teachers may emphasize 
pronunciation rather than meaning when instructing these students. When 
using ability grouping, these kinds of differentiations should be avoided. 

Commonly held opinion is that pullout programs have an undesirable 
stigma attached and that negative effects on students may occur as a 
consequence. However, research has not shown pulloul programs to be 
consistently inferior for achievement growth. And some in-class programs 
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amount to pull-out within the regular class (e.g., the resource teacher 
gathers the low achievers together in th* back of the room). The author's 
personal experience with Chapter I stuuents and teachers suggests that in 
the early grades thcr* may be little or no negative stigma attached to going to 
the Learning Center, although upper-grade students arc surely more aware 
of the reason that they go and arc more concerned about peer attitude. 

Instructional programs such as mastery learning and individualized 
instruction arc also discussed in the DOE sourcebook. Mastery learning 
strategies, as discussed by Bloom (1976), arc based on an instructional 
sequence in which student success on lower-level objectives is required 
pricr to movin^ on to more advanced objectives. Students arc allowed to 
vary on the amount of time (e.g., the number o( tria or attempts) required 
for nastcry; however, the theory is that almost Jl (95 percent) of the 
students arc able to master the basic ski!S. Although the term individualized 
instruction is used many different ways, the sourcebook uses the term to 
refer to implementation of mastery learning by individual pacing, individual 
diagnosis and prescription, and use of different materials for different 
students In Chapter I classes where small numbers of students may be 
prw»cni, it could be feasible to individualize effectively, though such 
individualization should not take the form of large amounts of unmonitorcd 
scatwork (which would supplant active teaching). 

The sourcebook briefly mentions "adaptive learning environments" 
and "cognitive education." By adaptive leaning environments they mean 
an instructional strategy that combines sequenced mastery learning with 
individualized instruction. Cognitive education refers to the recent work on 
teaching students how to learn— strategics for organizing, retrieving, and 
synthesizing information are ccvc-rcd. 

The last specific method discussed was cross-age tutoring, where older 
students (c g., in grades 4-6) ins;r jet \ oungcr students (such as grades 1-3). 
Research has supported the effectiveness of such tutoring, especially with 
students who arc initially lower achieving. The sourcebook also cites 
evidence that cross-age tutoring is more effective in math than in reading. 

Examples of use of appropriate instructional methods and materials in 
exemplary Chapter I projects include, use of <rcqucni (mastery) unit tests, 
focus on one or two skill* at a time, tutoring, direct instruction models (with 
maximizing of teacher student intcradionsh and use of computer-assisted 
instruction for individual pacing and increased student motivation. 

Maximum Use of Academic Learning Time 

The instructional attributes related to use of in-class times were ^ o 
discu scd. The Fisher, Filby. Marliavc, Cahcn, Dishaw, Moore, and 
Bcriin, r (1978) Beginning Tcichci Evaluation Study (BTES) and results 
regarding "academic learning time" constituted the primary research en- 
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dence Kffixttve teachers maximized allocated academic time and the 
amount of time students were engaged with the curriculum content. Also, 
BTKS results suggest :d that the level of difficulty of in-class wo,k was 
geared so that students experienced success. Other factors relating to use of 
available time were management strategies (the better managers had more 
time available for content) and assignment of homework (best with written 
comments from the teacher and if feedback on homework is not delayed). 
Exemplary Chapter I programs made effective use of time by careful 
scheduling of stu*. ,s, monitoring student success rate, training of students 
on how to remain "on task," and parent-monitored homework. 

High Expectations 

Following the Rosenthal and Jacobsen (1968) study, much attention in the 
research community has been focused on the phenomenon of teacher 
expectations. The results have indicated that expectations and belief sys- 
tems of teachers are often reflected in teaching behavior, and that behavior 
impacts how students respond and perform. Expectations are made visible 
in the kinds of feedback given to students, the amount of wait time 
(following a question from the teacher), and the within-class climate 
regaidhg (earning and discipline. Exemplary Chapter I projects focused on 
specific and appropriate feedback and included encouragement for student 
effort and attitude (in addition to achievement). 



Monitoring 

The use of tests, in cla^s performance asse^sments, and analysis of verbal 
responses fiom students to monitor student progress is positively related to 
learning One successful Chapter I project diagnosed student needs with 
frequent criterion-referenced testing. 

Feedback and Reinforcement 

Verbal pu^e has been found to be an effective reinforcci if it is genuine, not 
too frequent, specific, and truly contingent on superior performance. The 
sourcebook review notes th.it the impact of teacher praise probably dimin- 
ishes a, the grade of the student increases beyond elemental y. The authors 
indicate that foi upper grade and relatively higher SES (socioeconomic 
status) students, lomitivv feedback is an effective strategy — this kind of 
feedback conveys information about the student's response. Effective 
Chapter I programs have trained teachers to give very specific pi .use and to 
avoid renrimands that do not correct the student positively. 
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Recognition and Reward for Excellence 



The teachers who recognize and leward excellent performance from 
students can be effective users of . enforcement. The rewards may be 
symbolic or more concrete, such as small amount of free reading time a' the 
end of the period. Effective Chapter I programs use a variety of rewards foi 
superior performance, awards, certificates, phone calls or letters to parents, 
display of student work, special awards from the principal, and ceremonies 
for students who test out of the program. 



Of the 1 16 projects, 46 were from urban school districts, 27 from suburban 
districts, 27 from small towns, and 25 from rural areas. (Some projects were 
classified in more than one category.) The identified effective programs 
varied in size: 17 served less than 100 students, 49 served 100 to 499 
students, 41 served 500 to 2,500 students, and 9 served more than 2.500 
students. The majority of the projects were focused on elementary -grade 
students 91 of the 1 16 served kindergarten through grade 3 and 86 projects 
had students in grades 4 through 6. A total of 35 projects served students in 
grades 7 through 9, 22 were in grades 10 through 12. and i2 were serving 
preschool students. Some 94 projects delivered services in reading, 5S in 
math, 29 in language arts, and 23 in other curriculum areas (63 of the 1 16 
successful programs served students in more than one curriculum area). 

Each project was identified by three primary attributes of success, 
categorized cither as orgauzalional or instructional factors. In rank order of 
frequency of citation: 



SUMMARY OF EFFECTIVE 
PROJECT PROFILES 



Instructional attribute 



Number of project* 



Appropriate instruction 
Monitoring of progress 
Maximized academic learning 



66 
26 
23 



time 

High cxpccta!ior,s 
Feedback/reinforcement 
Recognition of excellence 



20 
14 

7 



OrganizaiU tnal attribute 



Coordination 

Parent/community involvement 
Professional development 



42 
41 
27 
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Instructional attribute Number of project v 

C'ear goals/objectives 25 
Strong leadership 20 
Use of evaluation 19 
Positive climate 18 



THE INSTRUCTIONAL DIMENSIONS STUDY 

Cooley and Lcinhardt (1980) reported a major investigation of regular 
classroom instruction of educationally disadvantaged students. Their study 
was designed to identify variation in classroom activ itics in reading and math 
(grades 1 and 3) and to relate that variation to learning outcomes. Observa- 
tions were made of the students while they were in the regular classroom. 

The Instructional Dimensions Study (IDS) wa* framed by a model of 
classroom processes. The IDS model addresses six classes of variables, 
initial student performance, opportunity, motivators, instructional events, 
structure, and criterion performance. Cach of these will he described briefly 
below. 

Initial student performance represents skill level oi ability of students 
prior to the educational intervention. In the model, initial performance is 
hypothesized as a direct influence on the outcome or criterion. 

The four constructs representing classroom processes are also pre- 
sumed to impact criterion performance directly. Opportunity refers to the 
students" opportunity to learn the content assessed as criterion. This 
construct derives from John Carroll's ( 1963) model of school learning, which 
included opportunity to Icain. In the IDS, opportunity was measured by how 
time was spent in classrooms and by the degree of match between the 
curriculum content and the tests. Motivators were assessed by measures 
relating to interpersonal climate and interactions in the classroom. Indica- 
tors included variables such as interest in curriculum material, use of peer 
tutoring, games and contests, and teacher praise. The instructional events 
measures addressed frequency, content, quality, and duration of teacher 
student interactions. For example, management variables, individual small- 
group 'whole-class measures, and questioning and feedback variables were 
all recorded. What Cooley and Lcinhardt refer to as structure is character- 
ized by level of organization of curriculum, specificity of objectives, ard the 
match of student and curriculum. Measures of class structure inJudeti how 
students were placed in the curriculum, use of mastery tests, formation of 
groups, attention to individual needs, and sequencing. 

The criterion performance measure wa* a standardized achievement 
test— the Comprehensive Test of Basic Skills. Classrooms were selected for 
the IDS through a multistage sampling process intended to increase the 
likelihood of identifying variation in classroom processes. Data collection 
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was carried out in 400 classrooms, in 100 different schools, from fourteen 
school districts in five different states. The classroom was the unit of 
analysis— process data and student test scores were aggregated to the class 
level Therefore, procc:,. outcome relations reflect among-class covariation. 
The method of collecting process data was b> videotaping classrooms, with 
subsequent coding of observation variables from the taped records (approxi- 
mately four hours per teacher). Teachers were also interviewed and curricu- 
lum experts generated data on curriculum indicators. 

In order to yield a more manageable number of classroom variables, 
process measures were combined under the framcwoik of the IDS model so 
that fifteen constructs were represented in the analyses. A total of 200 
dasscs entered the grade I analysis and 200 entered the grade 3 analysis. 
Coolcy and Lcinhardt indicated a preference for a "conservative** approach 
to one that would explicitly arcue causality. Therefore, they abstained from 
analysis of forma! causal models and proceeded to commonality analysis. 
They argued that commonality analysis: 

presents separately the confounded portions of effects on outcomes 
(called commonalities). These confounded portions arc not included in estimat- 
ing the irreducible effects (called uniquenesses) of the separate possible 
causes The conscrvativcncss of the method resides in the use of minimum 
rather than maximum estimates of separate effects (p. 16). 

IDS Results 

Means and standard deviations arc presented in Coolcy and Lcinhardt 
(1980) Here, we focus on process-outcome relations. One way to examine 
the overall effect of classroom processes is to examine the change (increase) 
in K 2 when process predictors arc added to a model that already has pretest 
as a predictor (The R- for a regression analysis indicate?* the proportion of 
outcome variation that is shared with, or can be "accounted for." by 
predirtor variables.) Those results indicated that combined process effects 
were greater in grade I than in grade 3. In grade I reading, the /?" value with 
the process variables in the model was 0.51. compared to 0,25 for she 
analysis of pretest and outcome. In grade I math, the R* increased from 0.15 
for pretest predicting posttcst to 0,43 for pretest and process measures 
predicting posttcst. These data indicate that the process measures (col- 
lectively) can account for as much or more outcome variation the 
classroom's pretest standing. 

The regression results for thi*d grade suggested less potent effects for 
process measures, although not zero effects. In grade 3 reading, the R- 
incrcascd from 0.74 (prjtcst alone as predictor) to 0,81 when process 
measures were included- about a 10 percent increase in R\ In grade 3 tnath. 
the R 2 went from 0.61 for pretest only to 0.73 for a model with pietcst and 
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classroom process variables — approximately a 20 percent increase in out- 
come variance accounted for. In the IDS third-grade data, the prc-with-post 
correlations were around 0.8 or higher, therefore, there was relative!} less 
outcome variation that could potentially be attributed to process variables 
Among the process variables, which specific indicators were most 
responsible for process covariation with outcomes? This question was 
addressed by the commonality analysis described above. Coolcy and 
Lcinhardt used raw gain as the outcome measure in the commonality 
analysis. One of the four process constructs — opportur.it> — correlated 
significantly positively with gain in all four contexts (wading anJ math, first 
and third grades). The opportunity variables included measures of the 
percent of students on task, the allocated instructional time, absenteeism, 
and curriculum-with-tcst overlap. The strongest individual correlations with 
gain were for measures of curriculum and test content overlap. IDS results 
did not suggest effects for class size — large classes were just as likely as 
small classes to have high gains. Also, measures of inuividualizcd instruc- 
tion did nut indicate that individualization methods were related to greater 
academic growth. 

Another interesting result was that the degree of pulloat instruction 
was nut related to achievement of students. Apparently, the nature and 
quality of the instruction was more important for learning than whether (and 
how many) students were pulled from their regular classroom. The authors 
summarize their findings by calling attention to the importance of direct 
instruction in curriculum areas that arc tapped by the criterion test (with a 
call for informed test selection and use, so that test content reflects desired 
values). 

REVIEW OF RESEARCH IMPLICATIONS FOR 
INSTRUCTION OF CHAPTER 1 STUDENTS 

Brophy (1986) recently completed a paper for the Office of Educational 
Research and Improvement of the DOC. "Hie focus was to synthesize the 
currently available research results with particular attention to the relevancy 
of those findings for Chapter I instruction The paper ranges broadly and is 
quite extensive. This discussion will deal with specific teaching behavior^ 
cited by Brophy and with the need for more research 

Brophy begins by dealing explicitly with the issue of nh\ the currently 
available research is adequate to mfofm poli^v makers about effective 
instiuetion for Chapter I students. This approach is somewhat unique in that 
he did not a.\s>umt that it would be sell evuknt that one could generalize 
earlier research findings to Chapter I classes Brophy cites fout reasons for 
why be bell ves research in regular Jassioom instruction has implications 
for Chapter | instruction. (I) reviewers of research have suggested that, in 
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some cases, the setting is less important than quality and quantity of 
instruction; (2) literature such as the aptitude-treatment interaction work has 
not clearly indicated that qualitatively distinct programs work best with 
certain types of students: (3) much of the regular classroom research {such 
as the Follow Through Program evaljations) was carried out in lower-SES 
schools; and (4) a portion of what little research iwu done in Chapter I 
settings (e.g.. the Oklahoma City study to be described later) agreed with 
results from regular classes. 

Brophy's four reasons arc due some consideration. One could argue 
that the near absence of empirical process-product work in Chapter I classes 
is the underlying component of each of the four reasons. 

Although quality and quantity of instruction are surely important 
factors for effective teaching, they may be mediated b; the context of the 
setting— such as small, homogeneous classes working with a teacher, an 
aide, and a wide variety of materials. In this conception, the context or 
setting could be more important for student learning than 'main effects" for 
quality or quantity of instruction. For example, the same teaching strategy 
may be effective or not effective, depending on the setting (teacher contact 
of individual students fits this pattern). 

More aptitude-treatment interaction studies do need to be done in 
public school classes. Whereas Brophy is correct about the current paucity 
of evidence, there have been some suggestions of ways of teaching effec- 
tively iha. vary with socioeconomic status of students, the Brophy and 
Evcrtson (1976) second- and third-grade study is one example. In addition, 
the Crawford (1983) study in Chapter I classes showed differences in 
effective teaching according to the grade-level of the students. Even ordinal 
interactions of teaching strategics and student characteristics can imply 
different effective profiles for different types of students. 

Even though much of the regular classroom research was done in 
lower-SES schools, that alone docs not mean that findings can generalize to 
Chapter I programs. That que- 'ion— degree of generalisation— depends on 
the strength of the Chapter I context as a mediator variable (and should be 
decided empirically). The results of the Oklahoma City studies suggest that 
context (as defined in that one program) can be a potent mediator Brophy 
docs include the following caveat near the end of his paper (while arguing in 
favoi of policy implications of regular research for Chapter I in the carlv and 
middle portions of the paper): 

Most proccss-ouicomc findings were based on natural variation observed in 
traditionally taught classrooms, so that generalization of these data is hmtledlo 
writ classrooms (p. 60). (cmpha^s added) 

And although many of the findings of the Chapter I proccss-produr. 
research (to be described in detail following the review of the Brophy paper) 
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did agree with earlier research, there were also many salient differences in 
indications of effective teaching in Chapter 1 and regular classes. The global 
or more "molar" variables (like timc-on-task or amount of instruction) do 
replicate across contexts. The more molecular variables (use of individual 
contacts duration of interactions, type of feedback given) may not replicate 
in different settings. 

It is easy to agree with Brophy that in the absence of a replicated 
empirical base generated from Chapter I classes, the next best thing would 
be reasoned extrapolation of regular classroom research results. One could 
also issue a strong call for more research done in the Chapter I setting The 
results from that research would inform us as to the strength of the Chapter 1 
setting as a mediating context variable. 

Under the heading of quantitative findings, Brophy discusses instruc- 
tional dimensions concerning how teachers can increase the amount of 
student engagement with curriculum content. Student opportunity to learn, 
teacher expectations, engaged time, and acti\c teaching arc described as 
methods for fostering classroom gains. Emphasis is placed on teacher 
delivery of curriculum cor.tcnt (rather than reliance on worksheets, books, 
supplemental materials, etc.). 

Discussion of qualitu..ve findings centered on process measures re- 
flecting how the teacher delivers content and keeps students on task. Brophy 
used three categories in synthesizing the research regarding quality of 
instruction, giving information (structuring), asking questions (soliciting), 
and providing feedback (reacting). 

Giving Information (Structuring) 

Brophy cites icsearch evidence that indicates support lor teaching strategics 
that structure or organize the content in such a way as to facilitate student 
learning. Effective teachers employ advance organizers, use overviews, 
outline lesson content, signal transitions, call attention to main ideas, and 
provide for review. In addition, achievement is fostered when the content is 
sequenced logically and is presented with appropriate redundancy Cbrity 
of presentation, as indicated by continuity and precision of verba e\pla- 
nations, has also been a consistent (positive) correlate of achievement 
(particularly when learning new or difficult material). Teacher enthusiasm 
probably relates more to student altitude than lo learning, but when it has 
correlated with learning, the relations have been positive. 

Questioning Students (Soliciting) 

Brophy notes that studies of the optimal difficulty of questions have yielded 
mixed results. The probable reason for this finding is that Vic appioprintc 
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ievel of diflicuUy varies witn setting or with type of* student. Hrophy 
suggests that the majority of teacher questions (he uses the figure 75 percent) 
should elicit correct responses When conducting fast-p^ced drill activities, 
he maintains that error rates in student responding should be minimal. 
However, when dealing with difficult material or trying to get students to 
synthesize or generalize lesson content, a higher error rate would be 
appropriate The Biophy and Evertson (1976) study found positive cor- 
relations for percent of questions answered ». jrrcclly and gain in lower-SES 
classes, but nc/ciwe correlations in their highcr-SES classrooms. This 
pattern suggested that effective teachers in highcr-SES classes challenged 
students with more difficult questions, whereas eflcctivc teacners of lower- 
SES students made greater use of "small steps," higher redundancy, and a 
high success rate when questioning. These findings will be contrasted with 
the Oklahoma City results of process-outcome analyses in • haptcr ! classes. 

Regarding tne cognitive level of questions. Bropny again points to 
conflict in researcn results. Certainly al" questions should not be higher- 
order (analysis, syntnesis, etc.). since questions requiring lower-level skills 
like knowledge am comprcnension have been shown to relate to student 
performance (even on higher-level objectives). Brophy concludes (hat 
teachers should lake (heir lesson goals, curriculum content, and student 
progress into account when deciding on lower- \crsus higher-order ques- 
tions Wncn questioning, effective teachers also made use of three to five 
seconds of "wait time"— by asking tne question and then waiting before 
calling on a student to respond. 

fn tne early grades, especially in smail-group settings, it is important to 
u-e metnods that ensure equitable student involvement: some teachers 
accomplish this by using patterned turns for calling on .student*, in the upper 
giades, lesson continuity and pace may be more important tnan simply 
engaging all students overtly. Those upper-grade siudcnts mav have more 
fully developed skills in learning by modeling (watcmng and listening to 
other students). 



Providing I-eodbari « Students (keactmg) 

If students respond coircctly. effective feedback may consist of only an 
acknowledgment from the teacher. Use of praise following correct re- 
sponses has been shown to have only low positive (and sometimes negative) 
relations with achievement. Ineffectiveness of too much praise may be due 
to its intrusivcncss. or it may embarrass tne student being praised (particu- 
larly older students), or it may not be "genuine" (if all or ncany all 
responses are praised), Brophy docs recommend that teacners of Chapter I 
students- " probably should praise these stuccnts more often than other 
teachers p.aisc their students . . " (page 22). However, ptoccss-product 
data that came from Chaptei I classo. showed negative correlations between 
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most measures of praise and achievement. This will be discussed in more 
detail in the next section. 

If a student rc^jonds incorrectly, the research has consistently indi- 
cated that teachers with high-gaining classes are more likely than others to 
sustain the interaction by rcpnrasing or repeating the question, or by 
providing clues to the student w ho answered incorrectly . Brophy points out 
that, in some situations, such sustaining feedback must be constrained so as 
not to amount to "pointless pumping." Students should probably not be 
allowed to remain silent or to mumble "I don't know" in response to a 
question (an incorrect response is better man no answer at all). 

When students offer questions and comments during a lesson, effective 
teachers will discourage such student initiations if they are not relevant to 
the lesson content. However, if the student verbalization is on- topic, then 
cfl'cctivc teachers hm the student idea, incorporating it into his or her lesson 
in a smoothly flowing manner. In particular, as the grade level of the 
students increases, it appears to become more important to use and 
encourage topic-relevant student ideas. 

Following his synthesis of the teaching effectiveness results. Brophy 
discusses qualifications of findings due to interactions w ith three factors, the 
grade or developmental level of the students, the subject matter being 
taught, and student socioeconomic status or ability. Other areas dealt with in 
the paper arc special cJucauon settings, cooperative learning, conceptual 
change teaching in science, the teaching of cognitive or "mciacognitivc*' 
learning strategics, and mastery learn ng approaches. 



THE OKLAHOMA CITY STUDY: 
PROCESS-OUTCOME RELATIONS IN 
CHAPTER 1 CLASSES 

In the spnng of the 1981-82 school year, pioecss data were collected in the 
Chapicr I classes in the Oklahoma City Public Schools. The Crawford (1983) 
document is the full technical report, and Crtwford (in press) focuses on 
differences and similarities fret ween the Chapter I results and process- 
product results from regular classes. 

Participants in the study wete the seventy mm Chapter I teachers in 
the district. Although the low inference coding s>stcm did record dyadic 
interactions between the teachers and individual students, individual student 
identity was not retained (teachers were the focus of the stady and the unit 
of analysis). Tony four of the teachers were in elementary grade classes and 
thiriyfuc were in fifth-year centers or middle schools. Fifth-year centers 
vvcie created as part of the district's dcscgicgation plan, these scnools 
prc\idc a transition year from elementary to middle school. 

Observers were trained to use the instruments reliably pnoi to data 
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collection. The low inference system was an adapted version of the Brophy 
and Evcrtson (\9 n 6) dyadic interaction analysis system. The coding system 
recorded public response opportunities, private teacher-student contacts 
(whether initiated by the teacher or the student), behavior management 
interactions, and public student-initiated questions or comments. High 
inference measures were completed at the end of each observation. The 
more global, high inference ratings addressed areas such as the difficulty of 
scatwork, use of aides, overall attcntiveness, and distribution of teacher 
contacts within the classroom. In addition, time allocation narrative was 
coded to yield indicators of on-task/off-task minutes and type of grouping 
patterns in use. Although the classes had paraprofessiona! aides assisting the 
teacher, there was link systematic analysis of the aides' activities (one high 
inference variable and anecdotal notes), since the teacher was the focus of 
the study. 

The data collection sampled three days of teaching between the first 
week of March and the first week of May. Depending on the grade level of 
the students, the length of an observation ranged from thirty to thirty-five 
minutes in the lower grades to fifty to fifty-five minutes in grades 6 through 
8 (More observations were done in the early grades, out the amount of time 
observing was the same across grades.) Data were obtained for 337 
observations and then aggregated to the teacher level. 

Student achievement scores were also aggregated to the teacher level, 
so the outcome measure reflects the average gain for a teacher's students. 
Therefore, correlations arc assessing the covariation among (Chapter I) 
teacher measures of process and achievement. The achievement data came 
from the California Achievement Test (CAT), total reading and total math 
subtests. Pretest data were taken from the spring. 1981. districtvvidc testing, 
and the posttcst measures were the cnd-of-year CAT scores from Mav of 
1982. 

The data analysis proceeded by running process-product correlations 
for the total sample, and for elementary (first through fourth grades) and 
upper grades (fifth through eighth) separately. The reading process data 
were correlated with reading outcome.*, and math processes were related to 
math achicxemcnt The teacher level gains entering the analyses included 
both raw achievement gains and regression-adjusted gains. 

Results of Process-Product Analysis 

Reading. Across all grade levels, there were several indications that 
active, task-oriented teaching was related to greater reading gam in Chapter 
I classes For example, the total number of mmutcs allocated for academic 
activity and the number of academic tcachcr-studcnt interactions were both 
significantly positively correlated with gain. In addition, the number of 
nonacademic contacts per minute « a rate variable) was negatively correlated 
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with gam. A variable that was calculated as the ratio of a teacher's academic 
rate of interacting to his or her nonacademic ra'e of interacting was a 
significant correlate of reading gain. 

Analysis of data from the full sample also indicated that a number of 
measures of the quality or nature of interactions were significant predictors 
of reading gain. The high inference rating of seatwork difficulty was 
positively related to gains, suggesting that teachers with high-gaining classes 
made relatively more use of challenging materials. Those same teachers 
were active monitors, circulating to students who needed help with their 
seatwork exercises. Across grades, results also suggested that teachers who 
asked relatively more opinion questions of Chapter I students tended to 
have higher gains. 

The proportion of all public response opportunities with teacher praise 
was correlated negatively with reading gain for the whole sample. The mean 
indicated that 8.6 percent of all response opportunities in reading were 
followed by praise. Teachers with rates less than this value tended to have 
higher gains. The results for criticism were mixed. In private (one-to-one) 
contacts when the teacher initiated the interaction, the correlations of 
criticism and gain were negative, whereas criticism following student- 
initiation of private contacts was positively related to gain. In any case, the 
means for criticism were very low (from 0.25 to about 1.2 percent of private 
contacts), so the positive correlation reflects the difference between teach- 
ers who fuver used critical feedback and those who use it very rarely 
Virtually all of the observed critical feedback instances were work-related 
("Billy, you can do a lot better than this") and did not criticize the student 
as a person. And lastly, the overall analyses in reading suggested that the 
Chapter 1 teachers with the higher gains avoided redirecting a question to 
another student after the first student failed to answer correctly, htst^ 
they were more likely to sustain the interaction with the fiist studciu. 
rephrasing or giving clues to help improve the indent's response 

Analyses were also carried out within subsamplcs defined by grade 
level, elementary (grades 1 through 4) and upper grades (grade 5 and middle 
school grades 6 through 8). 

Elementary results in reading indicated that high-gaining teachers 
relied on individualized (teacher and one student at a time) instruction and 
use of small groups to cover academic content Keep in mind that these are 
Chapter 1 pullout classes with a teacher and an aide and a full class si/e of 
six to eight students — an adult-to-child ratio of about one to four, As in the 
total sample, the elementary results support the profile of an active teacher 
(high numbers of academic contacts) who often sustains interactions when 
students respond incorrectly, and who use^ praise sparingly in a public 
setting. Student-initiated </r« stums in public were positively related to gain, 
whereas student comments were negatively correlated with reading gains in 
the elementary grades. 
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The high inference rating of scatwork difficulty was based on indicators 
such as how much help the students requested when completing their 
scatwork assignments. In both the elementary and upper-grade subsamples, 
that high inference variable was positively related to reading gain, suggesting 
that the teachers with the higher-gaining students often challenged those 
students with relatively difficult scatwork. Those same high-gaining teach- 
ers, especially in the early grades, also tended to have relatively more 
private, one-to-one interactions with students. 

Upper-grade results showed some differences from elementary grades. 
High-gaining teachers in grades 5 through 8 made rch)ti\cly more use of 
public qucstion-and-answer sessions ai.d were more likely to be spending 
time with the whole class, rather than with one student at a time. In most 
observed Chapter I contexts, the behavior management variables did not 
show strong or consistently meaningful relations with gains. Generally, the 
behavior-related measures showed lit Jc variance and had highly skewed 
distributions However, in the upper-grade reading results, there was sonic 
indication of the effective teacher as a successful behavior manager— high- 
gaining classes were characterised by few interruptions and high ratings on 
momentum and "withitness." From Kounin's (1970) work, a "wiihir 
teacher is one who is aware of behavior proK i > and communicates that 
awareness to the students. 

Math. Math results were also analysed for the total sample, and for 
elementary and upper grades separately . In math, acioss grades, the number 
of minutes of small-group academic time and the total number of dyadic 
contacts were significantly positively related to gains. If the verbal interac- 
tions constitute the virtual curriculum of the class, then these teachers arc 
covering content at a brisk pace. In math, as in reading, use of opinion 
questions was positively related to gain. When effective math teachers did 
contact students privately, it was more likely to be a brief interaction- 
suggesting relatively active monitoring. Private social interactions were 
negatively related to math gains, as was the use of a feedback strategy of 
redirecting a question to another student (when the first student answered 
incorrectly) The use of "vciy long" (more than thirty seconds continuously 
with one child) private contacts was also negatively related to math gains. 

In math in grades I through 4. there were negative relations with gain 
for public, social-procedural interactions, but positive correlations for 
student-initiated questions or comments. As in reading, public praise was 
negatively correlated with gain. The proportion of public response opportu - 
nities where the students answered incorrectly was positively related to 
math gains in the early grades. This suggests that teachers who ask more 
difficult questions tend to have higher-gaining classes. The mean for 
proportion incorrect (at the elementary level) was 0.23. with a standard 
deviation of 0 15 Therefore, in this Chapter I sample, the higher-gaming 
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teachers may have been eliciting errors on more than one-fourth to one-third 
of the occasions. Again, the small class size and the ability to provide 
substantive and nonthrcatening feedback may have h. , td produce this 
finding. Teacher criticism of student-initiated questions- comments was 
negatively related to math gains, suggesting a need for encouragement of 
student involvement in the lesson. 

In the upper grades in math, the total amount of time and the total 
number of academic minutes were positive predictors of gain. Also, the total 
anount of teacher praise was significantly negatively related to gain. Again, 
in the upper grades in math, the teachers who elicited relatively fewer 
correct responses to public question.* tended to have higher gains. The mean 
for grades 5 through 8 on proportion of response opportunities coded as 
correct was 0.65, with a standard deviation of 0.25. Therefore, some 
upper-grade math teachers who had high gaining students were getting moi: 
than 50 percent errors when questioning students publicly. These same 
analyses showed strong negative correlations for redirecting feedback, 
indicating that effective teachers sustained the interaction with a student 
when he or she did respond incorrectly. Teacher praise following student 
questions and comments was positively related to gain, whereas praise in 
other settings was negatively correlated with math gains. Infrequent use of 
critical feedback (fewer than I out of 100 interactions) was associated with 
math achievement gains in the upper grades. 



SUMMARY OF OKLAHOMA CITY STUDY: 
SOME CONFIRMATIONS 
AND SOME SURPRISES 

Some of the findings of the Chapter I process-product study replicated 
earlier research in the regular classroom, but many of the results did not 
agree with regular classroom results. In some instances, the Chapter I 
results differed from earlier research in lowcr-SES classrooms. If the 
differences in patterns uf effective teaching behavior were to be replicated 
by other studies, then the evidence would indicate that the context of the 
Chapter I instruction h an important mediator. 

Areas of agreement between regular classroom research and the 
Chapter I findings included documentation of the effectiveness of active 
instruction and student engagemeni with content. The amount learned by 
students is, at least in part, a function of interactions m class regarding the 
lesson content. The Chapter I study analyzed an "academic ratio" variable 
thctt was the ratio of a teacher s academic rate (number of interactions per 
minute) of interacting to his or her nonacademic rate. That measure 
con elated positively with reading and math gams, across elementary and 
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upper grades. Several studies done in regular classrooms (e.g., Stallings and 
Kaskowitz, 1974; Good and Grouws, 1979) found, as in Chapter 1 classes 
that teachers who sustain interactions with students tend to have higher 
gams; also, allowing students to remain silent or say "I don't know" was 
negatively related to gains. 

Both the Chapter 1 study and earlier work by Evcrtson, Anderson 
Anderson, and Brophy (1980) in regular classes indicated that classrooms of 
jumor-high-agc students with high gains were often taught by teachers 
concerned with rapport, as well as with academic achievement. For exam- 
ple, teacher acceptance and encouragement of student-initiated questions or 
comments and occasional use of brief -.ocial interactions were related to 
greater gams in fifth- to eight-grade Chapter I classes. The Evcrtson junior 
high study also found that whole-class (public) discussions were effective 
during math lessons, and this finding also arose in the Chapter I results 

Despite some areas of agreement with earlier regular classroom 
research, the Chapter I results also brought forth several notable dis- 
agreements. 

For example, there were indications in the Chapter I results that a 
degree of challenge in lesson content or in questions from the teacher was 
related to greater gain. In reading, ratings of seatwork difficulty correlated 
positively with gain, and in math, public questions that elicited relatively 
more student errors were associated with achievement growth. This findine 
seems in contradiction with the BTES results (Fisher et al„ 1978) and a E rces 
J c hjghcr-SES subgroup of results from the Brophy and Evertson 
(1976) study. The setting of the Chapter I classes could account for the 
discrepancy in results. There may be less anxiety about public responding in 
Chapter I math classes, perhaps facilitating learning through errors in that 
situation. Also, the small class sizes makes immediate, nonthreatening 
feedback possible. Relatively greater difficulty in seatwork or in questions 
may be a part of a complex of behaviors by which teachers communicated 
high expectations to the Chapter I students. Those higher expectations 
could lead to greater student achievement. 

In addition, Burstcin (1980) pointed out that the widely .cfcrcnccd 
Hshcr ct al. (1978) results regarding the need for a high success rate came 
from analyses with the student as the unit of analysis. When the student is 
the unit of analysis, in-class error rates constitute a proxy for ability 
rhcrcforc it makes sense that positive correlations with achievement would 

5? uTci' 1 , littlC 10 d ° With thc tcachcrs ' instructional strategics). When 
the BTES data were reanalyzed with thc teacher as thc unit of analysis 
negative correlations between gain and success rates emerged, which was in 
agreement with thc current results. Although results arc mixed, there arc 
indications that tcachcrs who make use of a relatively challenging style 
Smscs W ' th SUbStamiVC and Uscful fccdbi,ck ) tend to have higher-gaining 
Another area of some disagreement with regular classroom research 
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concerns the use of praise and criticism. The Chapter I results showed many 
negative correlations between measures of praise and gain, and some 
positive correlations between criticism and gain. The means of measures of 
praise were in the 10 to 15 percent range (10-15 percent of response 
opportunities), suggesting that teachers with lower-gaining classes may have 
been giving praise after every third or fourth question. Such overuse would 
diminish the effectiveness of praise as a rcinforccr and could communicate 
lower expectations to students. The means on critical feedback were 
typically less than I percent of the interactions. Positive correlations of 
criticism and gain might be another expression of a teacher's challenging, 
high-expectations style of teaching (higher-gaining classes had very occa- 
sional use of work-related criticism). These Chapter I results agree only with 
the highcr-SES findings of Brophy and Evertson (1976), but do agree with 
some of the results of Stallings and Kaskowitz (1974) from lower-SES 
Follow-Through classrooms. 

One area of disagreement with earlier research concerns the effective- 
ness of private, one-to-one interactions between the teacher and individual 
students. Researchers such as Brophy and Stallings found negative cor- 
relations with gain, whereas the Chapter I results (especially in reading in 
the early grades) showed positive correlations. Reports of the earlier 
research included speculation that individualized instruction per se was not 
ineffective, but rather that the rest of the class (of perhaps twenty-five or 
more students) may not be engaged with lesson content while the teachers 
works with one student. In Chapter I lab-type classes, an adult-to-child ratio 
of about I to 4 probably permits more active teacher (and aide) monitoring of 
the remainder of the class as the teacher works with an individual student. 
The aide can assist in monitoring and helping the other students and the 
teacher can easily maintain awareness of what the entire class is doing. 



DISCUSSION 

Even within a single study (such as McDonald and Elias, 1976; Cooley and 
Leinhardt, 1980; Crawford, in press), results for process measures arc 
qualified by grade level and subject matter. Many teaching behaviors arc 
effective in reading, but not math (or vice versa), or show different patterns 
of correlation in different grade levels. So it should not be surprising that 
some differences across studies will emerge as research accrues. Brophy 
(1986) discussed differences in effective strategics across three mediating 
variables: subject matter, grade level, and ability /background of the student. 
A fourth could be added for the setting or context of the instruction. Chapter 
I pullout classes look different from regular classrooms in terms of number 
of classmates, materials and equipment available, and access to the teacher. 
Additional studies need to be done in order to determine fully the nature of 
the effects of classroom setting. Other groupings of classes could be 
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analyzed: special education, bilingual, gifted/talented, and fine arts clasi.es 
could form the focus for studies of each particular setting. 

Nevertheless, we do have the beginnings of research-based knowledge 
about ptograms that work in compensatory education settings. By pulling 
together information from the DOE identification of exemplary sites with 
regular classroom research and Chapter I process-product results, it is 
possible to sec several consistent themes regarding impact on the achieve- 
ment of educationally disadvantaged students. Figure 9-! shows the major 
categories of variables. This framework is adapted from Centra and Potter 
( 1980). 

School District or Among-School Conditions 

Instructional leadership is a primary factor operating among schools. 
Typically this has been represented as the effect of the principal and his or 

FIGURE 9-1. Model of School and Classroom Effects in Compensatory Edu- 
cation 
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her instructional focus. In the context of Chapter 1 instruction, the program 
director may fill the leadership role. In addition, physical facilities, books, 
quantity of schooling, and curriculum focus arc candidates for important 
among-school factors. 



The most potent impact can result from extension of learning into the home, 
with the parent as teacher. Other community involvement activities may 
contribute to a positive climate and therefore indirectly enhance school 
effectiveness: PTA activity, adopt-a-school programs, and parent volun- 
teers in classrooms arc examples. 



In the context of effective Chapter I programs. < oordination of the remedial 
and regular classroom instruction is of prime importance. In addition, a 
positive climate has been shown to relate to achievement in the school 
effects research. Schools where professional development is fostered also 
tend to be more effective than others. The use of evaluation results and clear 
goals and objectives for the school arc also successful attributes. These 
within-school factors may be partly outside of the control of Chapter 1 
programs, but do contribute to the effectiveness of support programs like 
Chapter I. 



Teaching activities that occur within the compensatory education setting are 
proposed as having a direct impact on student in-class behaviors (which, in 
turn, no doubt have some reciprocal effect on those teaching strategies). In 
Chapter I classes, at tive instrm lion where the teacher transmits the content 
is an effective approach. Effective Chapter I teachers arc busy monitors of 
student learning, and they display high expectations of students through a 
concerted pattern of actions (such as assignment of relatively difficult 
seatwork, asking questions that cause some student mistakes, and focusing 
of student attention on the academic content to be learned). Effective 
programs may recognize excellence in student performance through sym- 
bolic or tangible reward systems. During classroom instruction, effective 
Chapter I teachers use praise judiciously (sparingly, only when it is truly 
called for) and may make very occasional use of work-related critical 
feedback. Primarily in reading in the early grades, effective teachers in small 
pullout classes made use of individual contacts (one-to-one interactions) 
with students. In math, content coverage and maintenance of a brisk pace 
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seemed more important than individual contacts. Teachers with high gains in 
reading and in math were more likely to sustain an interaction with a student 
if he or she answered incorrectly (instead of redirecting the question to 
another student). Encouragement of student involvement in the lesson was 
more important as grade-level increased. Effective teachers also were 
successful managers who found it ncccssar> to devote very little class time 
to behavior problems. 



Student Behaviors 

Students who are actively engaged with lesson content are more likely to 
learn that content. Further, opportunity to learn is impacted by the manner 
that teachers allocate class time. The content overlap between the classroom 
curriculum and the criterion test is of considerable importance (and criteria 
measures should be carefully selected based on desired values of schooling). 
And when students learn how to learn— through mctacognitivc strategics- 
greater achievement can result. 



Student Characteristics 

These background characteristics of students do impact student learning; 
those factors arc exogencous to the model in the sense of not being directly 
affected by school/classroom factors. Prior characteristics like SES and 
previous achievement level have a direct impact on students* learning. 
However, several studies have indicated that, particularly in the early 
grades, instructional factors may have an impact as great or greater than 
prior student characteristics. With appropriate instruction, educationally 
disadvantaged students may gain 15 to 20 (national) percentile points in basic 
skills achievement over the course of one year. 



Student Learning 

Exemplary Chapter 1 programs provide evidence of student achievement in 
basic skills, sustained gains over time, and increases in student attitude and 
affective outcomes that were related to program activities. Research has 
indicated strongest effects for compensatory education in the early grades 
(e.g., sec the nationwide study of compensatory education by Carter (1984). 

We have the beginnings of a research base regarding effective practices 
in Chapter 1 instruction. With more studies, it should be possible to observe 
further gains in knowledge about successful compensatory education. 
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EFFECTIVE STRATEGIES 
FOR ACADEMICALLY 
HANDICAPPED STUDENTS 
IN THE REGULAR CLASSROOM 

Barbara Larrivee 

California State University at San Bernardino 



During the 1960s and 1970s, the merit of special class placement for 
handicapped learners came under attack as mounting evidence challenged 
its positive effect on the achievement and social status of these students. 
Researchers consistently found that, at best, these special classes brought 
about academic performance no better than regular class placement (Budoff 
and Gottlieb, 1976; Calhoun and Elliott, 1977; Carroll, 1967; Goldstein, 
Moss, and Jordan, 1965; Johnson, 1962; Rittcr, 1978; Vacc, 1972). Thus the 
enactment of the Education for All Handicapped Children Act of 1975 
marked the culmination of more than a decade's efforts to promote regular 
class placement for handicapped students. 

However, having lived with mainstrcaming for the past decade, 
educators arc now questioning its effectiveness in achieving the goals 
originally intended. One of the major goals of mainstrcaming is the enhance- 
ment of social acceptability of handicapped students by their nonhandi- 
capped classmates, yet the outcomes of mainstrcaming on this variable have 
proved disappointing (Ashcr and Taylor, 1981; Gottlieb, 1981; Grcsham, 
1981, 1983; Lcvinc, Hummel, and Salzcr, 1982). It appears that mainstream- * 
ing per sc has had little impact on the social status of handicapped students. 
Likewise, handicapped learners educated in the mainstream have not made 
the academic progress expected as a result of integration and continue to fall 
further and further behind. The mere physical integration of handicapped 
learners with their nonhandicapped peers is clearly insufficient to bring 
about increased academic progress in the absence of carefully designed 
instructional strategics to accommodate low achievers. Thus the implemen- 
tation of mainstrcaming is presently occurring in the absence of an empirical 
data base warranting its continuation as currently practiced. 
© 291 
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Just as mainstream education is coming under attack as an environ- 
ment historically unresponsive to the needs of low-achieving students 
traditional special education practices arc likewise suspect. For example,' 
there is now general acceptance among special educators that little empirical 
support has accrued to warrant continued endorsement of the contention 
that perceptual, psycholinguists, or motoric training or remediation is a 
necessary prerequisite to the attainment of basic academic skills or a 
meaningful educational goal for handicapped learners in and of itself (c g 
Lamvee, 1981a; Meyers, 1980; Tarver and Dawson, 1978). 

Similarly, although intuitively attractive to special educators, a priori 
notching of teaching approaches to particular student characteristics iden- 
tified through assessment finds little support in the research literature 
Despite many efforts over the past two decades to identify aptitude- 
treatment interactions (ATIs), none have been substantiated to the degree 
required for direct application to differential instructional treatment of 
students (e.g., Cronbach and Snow, 1976; Lloyd, 1984; Zigmond and Miller, 
I "86/. 

Although research in the area of identification of students with learning 
disabilities has been exhaustive, the result of this massive effort has been 
failure to disclose reliable methods of distinguishing the learning disabled 
from other low achievers (Moran, 1984; Wilson, 1985). In fact, the data 
serve to continually document overlap in identification profiles, actual 
classroom performance, and demographic factors (Warner, Alley Schu- 

Z? r V DCShlCr ' a " d Qark ' ,980; Ysscld V kc ' Algozzinc, Shinn, and McGue 
1982). This finding has particular significance given the alarming increase in 
the number of students being identified as learning disabled (Algozzine and 
Konnck, 1985). Thus there is little research basis for offering instructional 
programs to learning-disabled children that arc different from those for other 
groups experiencing academic difficulty, such as the cducablc mentally 
retarded, emotionally disturbed, or those eligible for Chapter I remedial 
services. In fact, Gardner (1982) concluded that a special education 
student s categorical label is largely irrelevant for instructional purposes 
since there is no apparent interaction between diagnosis and teaching 
program. Further, recent research has shown that students in special 
education resource rooms with diagnosed reading problems may actually 
receive less overall instruction in reading than regular students, leading 
Hayncs and Jenkins (1986) to conclude that there is no obvious relationship 
in practice between student need and amount of assigned resource room 

!Z,?r aCt ?' rCadin8 instri,ction - Also ' Lcinhardt, Zigmond, and Cooley 
(I98U found that in the self-contained learning disabilities classrooms they 
studied, teachers spent only an average of sixteen minutes per day actuallv 
teaching reading. J 

Thus it appears that the very core of "special education" is being 
challenged on several fundamental assumptions of special education, such 
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as the need for specialized instructional programs aimed at remediation of 
perceptual, psycholinguistic, and motoric deficits, the need to match 
student-assessed needs with differential instructional treatments (ATI 
model); the utility of separate special education classifications for instruc- 
tional purposes; as well as the overall quality of instructional practices in 
special education settings. 

As educators begin to question the appropriateness of both special and 
regular education instructional practices for students experiencing learning 
difficulties, a revamping of ongoing instruction programs to make them more 
responsive to the growing population of children experiencing failure in our 
schools is needed. This chapter will review what we now know about 
effective teacher behaviors and classroom practices for students with 
learning problems who are maintstreamcd in regular classrooms for some or 
all of their academic subjects. 



TEACHER EFFECTIVENESS RESEARCH 

The emerging teacher effectiveness research base has come to the forefront 
as a possible means of increasing the effectiveness of school programs, 
especially in the area of basic skill acquisition. This body of literature has 
identified a core of specific teaching practices that appear to be reasonably 
stable across basic skill areas (e.g., reading as well as math), affective as 
well as academic outcomes, and across many types of students (at least at 
the elementary grade levels). 

Since teaching practices designed to benefit mainstrcamcd students 
must also be beneficial for their nonhandicapped peers for such practices to 
be advocated for use in integrated settings, traditional special education 
modes of research (e.g., treatment/effect studies, one-to-one tutorial tech- 
niques, case studies) arc likely to have limited application to the main- 
strcamcd situation in which the special child is educated with normal 
learners in large-class settings. 

A line of research that has rarely been utilized in special education 
research and that has important implications for the mainstrcamcd child is 
process-product research, that is, research conducted in classrooms for the 
purpose of identifying effective teaching behaviors. In this type of research, 
teaching behaviors are correlated with student performance measures, 
generally academic achievement, to identify significant relationships. Re- 
searchers engaged in this research over the past two decades have found that 
certain teaching behaviors consistently correlate with student achievement. 
Though much of this research has been conducted in classrooms serving 
disadvantaged students and/or low achievers, research has not been con- 
ducted specifically to examine the effect on mildly handicapped students, 
such as students diagnosed as learning disabled, educable mentally retarded. 
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orbchaviorally disordered, and receiving the majority of their education in a 
mainstrcamed setting. A study by this author (Larrivcc, 1985) was designed 
to extend the process-product research findings to the mainstrcamed popula- 
tion. This research is summarized in the following section. The underlyine 
premise upon which the research was based is that many of the effective 
teaching behaviors identified via process-product research in classrooms 
should also be appropriate for dealing with mainstrcamed students placed in 
regular classrooms. 



TEACHER EFFECTIVENESS IN THE 
MAINSTREAMED CLASSROOM 

Research Plan 

The procedural plan for the first phase of the research called for develop- 
ment of a comprehensive list of teaching variables for consideration based 
on the teacher effectiveness literature pertaining to regular elementary 
classrooms, selection of teachers with a proven record of success with 
mainstrcamed students, and, through observation and other assessment 
procedures, identification of common characteristics of teachers who arc 
effective with mainstrcamed students. 

The second phase was a large-scale correlational .study to examine the 
relationship between specific teaching behaviors identified in the first phase 
and various aspects of mainstrcamed student performance. This phase was 
intended to validate the teaching behaviors previously identified as effective 
tor mainstrcammg derived from the study of successful teachers 

Of particular relevance for selecting variables for consideration in this 

.ow LT rC -l/ 0S ^ tCilC u hi , ng Variables idcntificd as specially effective for 
low-SES and/or low-ability students. Several reviews of the considerable 
accumulation of process-product research provided by Mcdlcv (1977) 
Brophy (1979) Gage (1978), and Roscnshinc (1979) provided the primary 
basis for establishing an initial set of teaching behaviors to be studied 

These comprehensive reviews suggest a tentative picture of Ihc teacher 
who is effective in elementary classrooms with low-achieving students 
Given modest losses of precision, a descriptive profile of such a teacher will" 
be attempted. The teacher who is especially clTcctivc in this type of 
chissroom engages students in more lesson-related activities than the less 
effective teacher does, with more class time devoted to task-rclatcd or 
academic activities, more structure provided, more student-teacher interac- 
tions related to lesson content, and less time in which students arc 

X U S ? r CUVC tCi ', Chcr SpCmls morc time in 8«>u P instruction, 
malting the amount of time students spend working independently or in small 
groups less in the effective teacher's classroom. 
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In terms of questioning strategies, the effective teacher is most likely to 
ask a question and then choose a respondent who has not indicated a desire 
to answer. The questions asked are predominantly factual and elicit re- 
sponses that arc usually correct, seldom meriting further discussion. This 
teacher is likely to raise a question first and then to indicate who is to answer 
it, possibly as a way of holding students 1 attention. The student chosen to 
respond is likely to get the right answer, since these teachers ask more 
appropriate questions and have a better sense of the difficulty level their 
students can handle, Criticism of a pupil's answer is rare, even when it is 
incorrect. If a student fails to answer or docs not know the answer, this 
teacher is more likely to help out b> rewording the question or asking an 
easier one than to turn to another student. The ratio of student-initiated to 
teacher-initiated questions and comments also tends to be lower for the 
effective teacher. 

In the area of classroom management, there is less deviant or dis- 
ruptive student behavior in classes taught by the effective teacher. Teacher 
rebukes arc less frequent and less time is devoted to managing the 
classroom. The successful teacher controls the class with less criticism and 
uses a more varied repertory of techniques in doing so. This teacher also 
uses more praise or positive motivation. Permissive behavior, or giving 
students freedom to govern their own activities, is consistently found to be 
less common in the effective teacher's class. In summary, the effective 
teacher maintains an atmosphere that is supportive and relatively free from 
disruptive student behavior. This environment i» sustained with little 
obvious effort and without expression of negative affect. Finally, the teacher 
who is effective with lower-ability students spends more time assisting 
individual students and checking their individual work, and is more likely to 
have been the initiator of the contact. When the effective teacher responds 
to questions related to content, he or she tends to interact longer and pay 
closer attention to the student than the less effective teacher 

Although the emphasis of the rcscaich design in the Larrivcc (1985) 
study was on teacher variables deemed to be important for the academic 
success of lower-ability students in regular classrooms, the design was 
supplemented by inclusion of some variables from the affective domain 
believed to be of particular concern for the special needs child, such as 
teacher responsiveness, relationship with students, and awareness of feel- 
ings. Also incorporated into the overall design were variables derived from 
advocated special education practices, such as providing opportunities for 
success and a supportive and highly structured environment. Academic 
learning time (ALT) was also included as an indication of actual opportunity 
to learn for the mainstrcamed child. Lastly, a scries of variables was 
included to explore specific intervention strategics directed at particular 
student behaviors characteristic of special needs students, as were several 
teacher attitudinal variables, since much research is available to support the 
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importance of a favorable attitude toward mainstreaming handicapped 
children (e.g., Harasymiw and Home, 1976; Higgs, 1975; Home 1979- 
Larrivee, 1981b). 

Ultimately, the final list of variables to be considered in the design 
consisted of sixty-eight teaching variables and six attitudinal variables. 
These variables encompass the domains of questioning strategies, teaching 
style, affective classroom climate, classroom management, academic 
learning time, and individualization. 

During the first phase of the research, the operational plan called for 
the selection of regular classroom teachers who had demonstrated their 
effectiveness with special needs students placed in the regular classroom 
setting. Teachers were selected based on the performance of these students 
who were assessed on a pre-post basis on academic, behavioral, social, and 
attitudinal variables. Since all children in schools are affected by main- 
streaming, successful mainstreaming practices would necessarily have to be 
effective for the nonhandicapped child as well. In fact, one would hope that 
increased attention to individual student needs and greater individualization 
of instruction would eventually result in improvement in the quality of 
education for all children. Hence, our definition of teacher effectiveness was 
influenced by the belief that teachers identified as effective for this study 
should be effective not only with mildly handicapped learners placed in their 
classrooms but also with their nonhandicapped classroom peers. Teacher 
effectiveness was further operationalized to mean producing greater prog- 
ress thar. would normally be expected over the duration of the school year 
on several dimensions deemed as important educational outcomes for both 
handicapped and nonhandicapped students. This definition required that 
gains on each of the student performance measures, as well as gain on an 
overall profile score, be averaged over all children in all classes to provide a 
standard of comparison. Similar averages among the special needs children 
in all classes were determined to provide a meaningful comparison for these 
children in particular. 

Considering gains made by the special needs students as well as gains 
made by the class at large, twelve teachers were identified as outstandingly 
effective from an original data base of thirty-three elementary classrooms. 
Subsequently, more than twenty classroom observations were conducted in 
each of the classrooms of the identified teachers in an effort to isolate 
characteristic teaching behaviors. Ultimately, forty-two of the teaching 
variables considered were determined to be characteristic of teachers who 
are effective with mainstreamed students, as were an additional four 
attitudinal variables. These variables are designated in Figure 10-1 by one 
asterisk (*), indicating that the effective teachers engaged in a high amount 
of the behavior, and by two asterisks (**), indicating a low amount of the 
behavior to be characteristic of the effective teachers. Variables without 
asterisks were not found to be significantly correlated with performance 
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gains for mainstrcamcd students. (For detailed descriptive information on 
this phase of the study, oce Larrivcc, 1980, 1982, 1985, and Larrivcc and 
Vacca, 1980.) 



Analysis and Results 

The design, methodology, analysis, and results of the final phase of the study 
have been presented in detail elsewhere (sec Larrivcc, 1985) and will only be 
summarized here for the purpose of discussing potential ramifications of the 
research findings. 

The large-scale correlational study was based on a sample of 118 
classroom teachers and selected mainstrcamed students. The sampled 
mainstrcamcd students had a primary classification of learning disabled but 
also included bchaviorally disordered and speech and hearing impaired 
mainstrcamcd students. The design incorporated teacher characteristics 
expected to affect performance with mainstrcamcd students such as highest 
degree earned, number of courses in special education, years of teaching 
experience, experience with mainstrcamcd students, perceived success with 
mainstrcamcd students, and attitude toward the practice of mainstrcaming 
special needs students. 

For the mainstrcamcd students, data pertaining to age, sex, prior 
achievement, type of handicapping condition, length of time mainstrcamcd, 
and amount of time spent with specialists were collected. Additionally, 
grade level, class size, proportion of mainstrcamcd students in the class, and 
number of reading groups were included in the design as well as school-level 
variables such as administrative support and availability of supportive 
services. The following performance measures for the selected main- 
strcamcd students were considered: reading achievement, social status, 
self-perception of peer acceptance, overall maladaptive behavior as well as 
eleven individual behavior scores, off-task rate, difficulty level of learning 
tasks, student and teacher transition time, waiting-for-help time, and aca- 
demic learning time (ALT), defined as the time a student spends actively 
engaged in a content-related task that is of appropriate difficulty. 

Thus this massive data set included twenty-one dependent variables 
assessed for selected mainstrcamcd students and forty-eight independent 
variables, including twenty-seven classroom observation vnriablcs, each 
coded on four occasions for 118 classroom teachers. 

Of the forty-two teaching behaviors identified as characteristic of 
teachers effective with mainstrcamcd students, thirty-four were included in 
this phase of the study. To determine the degree of relationship between 
each teaching behavior and the student performance measures, a scries of 
partial correlation analyses was conducted. Grade placement, sex, length of 
time mainstrcamcd, and hours of services provided were selected as control 
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FIGURE 10-1. reaching Variables 



QUESTIONING STYLE 

Volunteer Respondent • Content Questions 

Student Selection - >.ow-Order Questions 

Narrow Questions * Correct Student Response 

' Positive Feedback Criticism of Response 

• Sustaining Feedback 



CLASSROOM CLIMATE 



Movement-Free vs. Restncted 
Affective Environment 
Physical Environment 
Noise Level Appropriateness 
Non-Permissiveness 
Controlling Behavior 
Acceptance of Feelings 



Awareness of Feelings 

* Warmth 

* Teacher Responsiveness 

* Teacher Fairness 

* Performance Expectation 

* Relationship with Students 

* Initiation of Student Contact 



INDIVIDUALIZATION 



Time in Small Groups 
Time in Large Groups 
Teacher Time with Individuals 
Individualization of Work 
Grouping for Math 



Checking Student Work 

* Ad Hoc Grouping 

* Instructional Appropriateness 

* Grouping for Reading 

* Attention to Individual Needs 



CLASSROOM MANAGEMENT 



Supportive Response to Conduct 

Problems 
Supportive Response to High 

Severity Behavior 
Teacher Consistency 
Use of Praise 

* Supportive Response to Learning 

Problems 

* Supportive Response to 

Personality Problems 
1 Supportive Response to Low 

Ability Students 
' Effective Use of Time 



* Votal Supportive Response 

* Task Engagement Feedback 

* Variety of Interventions 
Need for Discipline 

** Total Punitive Response 

** Punitive Intervention 
** Incidence of Intervention 



ACADEMIC LEARNING TIME 



Allotted Time 
Teacher Directed Time 
Student Directed Time 

* Easy Difficulty Level 

* Engagement Rate 

* Academic Learning Time 
Special Individual Work Time 



** Unassigned Time 

** Teacher Transition Time 

** Student Transition Time 

Waiting-for-Help Time 
"Off-Task Time 

Hard Difficulty Level 
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FIGURE 10-1. (continued) 



Assignment of Tasks 
Assignment of Homework 

* Teacher Flexibility 

• Lesson Structure 



TEACHING STYLE 
* Clarity 



* Academic Feedback 

* Active Involvement 



OPINION AND ATTITUDINAL VARIABLES 



Situational Job Satisfaction 
Educational Philosophy 
* Positive Attitude Toward 



* Professional Job Satisfaction 

* Scope of Professional 



Responsibility 
* Teacher Self-Perception of 



Mainstreaming 



Competence 



* High amount characteristic of effective teachers. 
** Low amount characteristic of effective teachers. 

variables. Prcrcading achievement was controlled when postrcading was 
considered. The results to follow arc, of course, limited by the fact that the 
data arc only correlational, and thus no causal inferences can be drawn from 
the data. An alternative plau: : Hc explanation is thai higher-achieving, more 
well-behaved classes were observed and the selected covariatcs undcrcon- 
trollcd fot these factors. 

it is of interest to note that no significant correlations were found 
between the number and/or ratio of mainstrcamcd students within a class 
and any of the teaching behaviors or any of the mainstrcamcd student 
measures. Apparently the number of mainstrcamcd students in a class did 
not in itself influence teaching behaviors or mainstrcamcd student achieve- 
ment. These results, coupled with the earlier finding that those teachers who 
were successful with their special-needs students «vcrc able to accomplish 
these gains while simultaneously achieving similar gains for all their 
students, serve to challenge the generally accepted notion that the extra time 
required to educate mildly handicapped learners wiil bo to the detriment of 
the rest of their classmates. 

The only student measure that did not correlate significantly with any 
of the teaching behaviors examined was self-perception of peer acceptance. 
These results could be interpreted to mean that mainstrcamcd students' 
perceptions of their acceptance is unrelated to specific teaching behaviors, 
and is more influenced by other factors. 

Due to the magnitude of the data set, a large number of significant 
relationships between teaching variables and mainstrcamcd student perfor- 
mance measures were identified from the analyses. In an effort to summarize 
the results, Table 10-1 has been organized to include the four categories of 
performance measures. Included here arc all teaching variables that had a 
minimum of three meaningful significant correlations with the mainstrcamcd 
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TABLE 10-1. Significant Correlates of Mainstreamcd Student Measures 



Mainstreamcd Student Performance 



Teacher Variable 



Reading 
achievement 



Social 
acceptance 



ALT 
variable* 



Behavior 
variable' 



Sustaining Feedback 
Positive Feedback 
Easy Difficulty Level 
Correct Student Response 

Student Transition Time 
Off-Task Rate 
Incidence of Intervention 
Pi litive Intervention 
Criticism of Response 
Total Punitive Response 
Supportive Response to 

Low-Ability Students 
Supportive Response to 

Learning Problems 
Total r upportivc Response 

Need for Discipline 
Efficient Uc of Time 



X(-) 
X(-) 



X(-) 



X(-) 
X 



ALT 

ALT, EDL, OTR(-) 
EDL, ALT 
ALT, EDL, STM 

ALT(-), OTR 
ALT(-), TT 
ALT(-). OTR, TT 
ALT(-), EDL(-), TT 



OTR(-) 

OTR(-) 

ALT(-), OTR 
ALT, OTR(-) 



CI 
IW(-) 

T(-),CD(-), DD(~), IR(-), 
EB(-), IW<-),C 
CD 
ER 
ER 



T, LCD, IW, IR, C(-) 
T(-),CD(-), I(-), EB(-), 
AA(-), IW(-), IR(-), C 
CD(-), IW(-) 

T(-).CD(-). IW(-), I(-). 
1R(-),C 



* ALT * Academic Learning Time; OTR a 
Transition; WHT « Waiting.for-Hclp Time. 



Off-Task Kate; EDL 



1 basy Difficulty Level; ST 



Student Transition; TT * Teacher 
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student performance measures. Since a variety of outcomes arc desirable for 
handicapped students placed in regular classrooms, it was decided to 
consider only those teaching behaviors correlating with at least three 
measures of mainstrcamcd students' classroom performance as effective 
teaching behaviors for mainstrcamcd settings. The single exception to this 
criterion was the variable labeled "criticism of response;' correlating 
significantly with only one of the performance measures— peer acceptance 
of the mainstrcamcd student. This variable was retained based on the 
relative importance of social outcomes for mainstrcamcd handicapped 
students. 

Ultimately, a total of fifteen teaching behaviors or strategics emerged 
based on their relationship to several desired outcomes for mainstrcamcd 
students. These behaviors arc as follows: 

1. Frequent positive feedback to students 

2. Providing sustaining feedback to students responding incorrectly to 
questions • 

3. Supportive, encouraging response to students in general 

4. Supportive response to low-ability students in particular 

5. Supportive response to problem behaviors indicative of a learning 
problem (as distinguished from conduct problems) 

6. Asking questions that students answer correctly 

7. Providing learning tasks that students can accomplish with a high rate of 
success 

8. Using classroom time efficiently 

9. Low incidence of teacher intervention 

10. Infrequent need to discipline students 

11. Limited use of punitive interventions 

12. Minimal punitive response to students 

13. Rare criticism of student responses 

14. Little student transition or noninstructional time 

15. Low rate of student off-task time 

Discussion 

The descriptive profile that emerges for teachers who arc effective with 
mainstreamed students is remarkably similar to that of teachers who arc 
effective for all students, based on an emerging body of teacher effective- 
ness literature (e.g., Brophy and Good, 1986). The teaching behaviors 
identified here as effective relative to questioning strategies (i.e., giving 
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positive feedback for correct student responses: using sustaining feedback 
or attempting to prompt, rephrase, or provide clues for incorrect responses: 
high correct response rate: and infrequent criticism of student responses) 
have been well documented as effective teaching practices in classroom 
settings. The findings for opportunity to learn variables, in particular 
instructional appropriateness (operationally defined as easy difficulty of 
tasks, high engagement rate, and little transition time, indicating more 
instructional time), have also appeared repeatedly as effective teaching 
behaviors based on a multitude of process-product research findings. 

Indicators of effective classroom management skills, specifically low 
incidence of intervention, lack of need for discipline, and efficient use of 
time, have likewise consistently appeared on lists of effective teaching 
behaviors derived from the process-product literature. The remaining iden- 
tified teaching variables pertain to intervening and responding punitivcly to 
students. Here, the data support the common finding that punitiveness is 
negatively correlated with student performance. 

The recommendations for teacher behavior to accommodate main- 
streamed students resulting from this research would be that teachers 
provide positive and encouraging feedback to students responding to 
questions (75 to 95 percent of the time), give sustaining feedback when 
students answer incorrectly (for from 60 to 80 percent of incorrect re- 
sponses), ask questions during instruction that students can answer cor- 
rectly (correct response rate of from 70 to 90 percent), refrain from 
criticizing student responses (less than I percent), and provide instructional 
materials and learning tasks that students appropriately engage in (80 
percent engagement or better) at a low error rate (80 percent or more 
correct). Furthermore, it is desirable that teachers working with main- 
streamed students provide a supportive classroom environment in which 
there is little student transitional or noninstructional wait-time (less than 8 
percent) and where the need for discipline as well as the actual intervention 
rates (less than six per hour) arc low, and use of punitive intervention;, is 
minimal (less than 5 percent). 

Whereas the majority of the teaching behaviors recommended for the 
benefit of mainstrcamcd students have general applicability for elementary 
classrooms at large, three teaching behaviors in particular appear to be 
required with greater frequency for special needs students: providing 
positive and encouraging feedback, high success rate, and lack of criticism. 
Mainstrcamcd students and other low-ability and/or low-SES students 
apparently require greater amounts of positive and encouraging feedback 
and higher success rates, especially when working independently. An 
extremely low level of criticism is likewise beneficial for these types of 
students, whereas there is some evidence that higher-achieving students 
benefit from occasional criticism (although even here, criticism should be 
relatively infrequent). 



9 

ERIC 



Strategics for Academically Handicapped 303 



The results reported here suggest a tentative profile of the teacher who 
is effective with mainstrcamcd students. The successful teacher asks factual 
questions that arc most often answered correctly by students. Correct 
answers receive positive feedback, whereas incorrect answers receive 
supportive* clarifying feedback. Criticism is hardly ever used. The effective 
teacher is very responsive to students and holds high expectations for them. 
In the classrooms of successful teachers, interventions arc supportive, 
student contact is teacher-initiated, and on-task behavior is reinforced. 
Students work at their own instructional level, arc appropriately engaged in 
learning activities, and seldom need disciplinary interventions. 

These research results indicate that the mildly handicapped child can 
be accommodated by teaching practices that arc beneficial to the class as a 
whole, since teaching strategics that meet the needs of mainstrcamcd 
students arc also likely to be effective practices for the majority of students. 

Since this research did support the position that the identified effective 
teaching behaviors would likewise have a positive outcome on the academic, 
social, and behavioral performance of mildly handicapped learners, the 
findings serve to substantiate further the gcncralizability of the identified set 
of teaching behaviors to regular classroom settings serving children with 
mild learning handicaps. 

It is important to consider here that the notion still persists, despite 
evidence to the contrary, that an academic focus or orientation will 
necessarily operate to the detriment of affective development. Yet Medley 
( 1977), in comparing pairs of relationships, reported a 72 percent overlap for 
achievement gains with attitude toward school, and a 75 percent overlap for 
achievement gains with measures of self-concept. Similarly, in a later phase 
of the Beginning Teacher Evaluation Study (BTES) that included an 
ethnographic study of forty elementary classes, academically effective 
teachers were able to develop a generally positive classroom atmosphere 
while simultaneously orchestrating an efficient learning environment (e.g., 
see Berliner and Tikunoff, 1976, 1977). Indeed, the data reported here 
showed that individual teaching behaviors (i.e., sustaining feedback, off- 
iask rate, student transition time, and incidence of intervention) correlated 
with as many as three of the four outcome areas considered— achievement, 
academic learning time, social status, and behavioral adjustment. In fact, 
nine of the fifteen effective teacher behaviors were associated with positive- 
outcomes in achievement or academic learning time in addition to desired 
social or behavioral outcomes. 

Thus it appears that although teachers may operate somewhat 
differently— that is, vary their teaching behaviors according to subject 
matter, grouping structure, lesson format, or their intended outcomes— they 
frequently arc more or less effective across subjects (in particular, reading 
and math) and produce consistent affective outcomes. This conclusion is 
supported in the present data, in Medley's review, and in the early Brophy 
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and Evertson data (1974). Moreover, in this study, all of the teachers who 
were found to be highly successful with mainstreamed students were 
likewise effective with the total classroom. However, it is important to note 
that the reciprocal relationship (i.e., those teachers successful with the total 
classroom crossed with those experiencing success with special-needs 
students) showed only 47 percent overlap. Apparently, realizing success 
with the class at large is not predictive of success with the special student 
even though the opposite is true. 

Nonetheless, as Brophy and Good (1986) have recently pointed out, 
although it may be possible to optimize student progress along sevcrai 
dimensions concurrently to a certain degree, beyond some point further 
progress toward one set of goals will come at the expense of progress toward 
other goals that necessarily compete within the confines of the classroom 
No doubt some tradeoffs must be faced because of the rival forces that 
operate in the typical classroom setting. 

The data reported here indicate that different sets of teaching behav- 
iors might be advocated depending on the prioritizing of specific educational 
outcomes for mainstreamed students. The behaviors identified in this 
research could be organized into clusters of teaching behaviors that arc 
primarily related to specific types of outcomes for mainstreamed students 
For training purposes, the following four category distinctions arc offered: 

(1) Classroom Management and Discipline, (2) Feedback During Instruction 

(3) Instructional Appropriateness, and (4) Supportive Environment. 

Utilizing this organizational framework, each of the units incorporates 
several compatible teaching behaviors and classroom practices and is 
predominantly associated with a specific type of student performance. In the 
Classroom Management and Discipline cluster, the specific teaching behav- 
iors and classroom conditions recommended arc: (I) efficient use of time 

(2) low inc.dcncc of intervention, (3) minimal use of punitive interventions' 

(4) lack of need for discipline, (5) infrequent student transition time, and (6) 
low student off-task rate. In terms of advocated practices, teachers would 
establish a classroom setting for mainstreamed students that offers a 
well-organized, highly structured environment in which teacher time is 
appropriately allocated with little student transitional or noninstructiona! 
wait-time or student off-task behavior. Need for discipline as well as actual 
intervention rate should be low, and use of punitive interventions minimal. 
These classroom management behaviors correlated significantly with 
greater academic learning time as well as reading achievement. 

The s.;cond cluster. Feedback During Instruction, would include- 
(I) providi>.g positive feedback, (2) giving sustaining feedback, and (3) 
avoiding criticizing student responses. Recommended classroom practices 
when giving feedback to students relative to instructional content (not 
behavior) would be that teachers provide positive feedback to students 
responding correctly to questions and use sustaining feedback for students 
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answering incorrectly. For example, teachers should ask clarifying or 
helping followup questions intended to give students a second chance to 
respond correctly. Furthermore, teachers should refrain as much as possible 
from criticizing students' incorrect responses. This cluster of teaching 
behaviors was associated primarily with greater peer acceptance. 

In the third cluster. Instructional Appropriateness, the recommended 
teaching practices arc: (1) appropriate difficulty of tasks and (2) high rate of 
correct student response to teacher questions. Teachers should provide 
instructional materials and tasks that students appropriately engage in at a 
low error rate most of the time. Questions asked during instructior#should be 
geared to the target group or individual such that student rate of correct 
response is high. Both these teaching practices correlated significantly with 
greater academic learning time and with less inattentive and withdrawn 
behavior. 

The Supportive Environment cluster would include: (1) use of suppor- 
tive interventions, (2) responding supportively to low-ability students, 
(3) responding supportively to learning problem behaviors, ana (4) infre- 
quent usage of punitive response options. The recommended teaching 
practices would be that teachers working with mainstrcamcd students 
provide a supportive, nonthrcatening classroom environment. Accom- 
plishing such an atmosphere would entail greater use of supportive behav- 
iors and intervention strategics directed at specific student behaviors 
characteristic of special needs students. Employing these teaching strategics 
correlated significantly with less inappropriate student behavior. 

Thus, depending on the outcome of interest, different teaching behav- 
iors could be advocated. These clusters of behaviors might alscbc imple- 
mented successively according to prioritized goals for individuals or groups 
of mainstrcamcd students. 

ALTERNATIVE CLASSROOM MODELS FOR 
MAINSTREAMED CLASSROOMS 

This chapter has principally discussed effective teacher behaviors within the 
traditionally organized classroom. However, several groups of researchers 
have developed and evaluated classroom instructional methods quite differ- 
ent from traditional instructional practices, and have examined their effects 
on mainstrcamcd as well as nonhandicapped students. This section dis- 
cusses such methods, ^ 

The Need to Reconsider Current Practice 

Several factors arc currently contributing to the present reexamination of 
existing policies regarding education of academically deficient students and 
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are providing impetus for the movement for a coordinated service delivery 
system to accommodate all at-risk students. This would include students 
with diagnosed learning handicaps such as the learning disabled, cducablc 
mentally retarded, bchaviorally disordered, and so on, as well as economi- 
cally deprived students currently serviced by Chapter I. students receiving 
remedial instruction in basic skill areas, as well as all other students falling 
behind in the acquisition of basic skills. The first factor is the alarming 
increase in the number of school-age children currently experiencing failure 
in our school systems. An even greater increase is projected for the years to 
come as children from low-SES and minority backgrounds become an 
increasing proportion of the school-age population. As estimated by Will 
(1986), some 20 to 30 percent of the school-age population (or nearly 8 
million) arc having difficulty progressing in our public schools. The sheer 
numbers alone argue for new strategics to enhance the educational success 
of these students. 

The second factor is the increasing failure of our special education 
system to remediate learning problems or to impact on the cumulative deficit 
phenomenon. Similarly, recent investigations of instructional practices 
occurring in special education settings indicate a sparcity of quality prac- 
tices, such as failure to make use of assessment data (Joyce, !980' McNair 
and Joyce, 1979: Potter and Mirkin, 1982), failure to evaluate students- 
progress regularly (Fuchs, Fuchs. and Warren, 1982), failure to allocate 
sutficicnt time to basic skill instruction (Lcinhardt and Pallay. 1982), use of 
slow-paced instruction with few lessons completed and less content material 
read (Madden and Slavin, 1983). as well as limited direct instruction 
{Lcinhardt, Zigmond. and Cooley, 1981). These findings lead many educa- 
tors to view special education as not, in reality, special but as potentially 
inferior to regular education. 

The third major factor is the expanding discrepancy between low and 
normal achievers as they progress through the curriculum content. A 
corollary issue is the new "great debate" orthc ••timc/achievcment/cquality 
debate, ' with evidence supporting the notion that the gap is ever increasing 
with initial academic differences in learning rate (or time needed to learn) 
remaining stable (sec Bloom, 1984: Slavin. 1987). 

Research evidence is beginning to accumulate to support the notion that 
the majority of academically deficient students can achieve both academic 
and social success in heterogeneous class groupings, provided effective 
teaching is practiced and adaptive, individualized, and/or cooperative 
learning strategics arc implemented. In this section of this chapter, instruc- 
tional programs and strategics that have been employed successfully with 
academically handicapped students in regular class settings will be reviewed 
for the purpose of isolating specific programmatic components as well as 
identifying those components corresponding to the variables identified in the 
teacher effectiveness literature. Direct instruction techniques, the Adaptive 
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Learning Environment Model (ALEM), cooperative learning strategics, 
peer tutoring, and learning strategies will be addressed, respectively. 
Although each approach is considered separately, it is important to note that 
these methods have overlapping components. Furthermore, a thorough 
discussion is not being offered here since these programs arc discussed in 
other chapters of this book. 



Direct instruction has recently come to have two meanings. Traditionally, 
direct instruction referred to work primarily done by Engclmann and his 
colleagues (e.g., Becker, Engclmann, Carnine, and Rhine, 1981); however, 
more recently the term direct instruction often refers to a set of procedures 
derived from the teacher effectiveness literature and delineated by Rosen- 
shine (1976). Roscnshinc and Stevens (1984) concluded that low-achieving 
students show greater academic achievement when their teachers follow a 
consistent pattern of demonstration, guided practice, and feedback, irre- 
spective of curriculum. 

Though the term generally connotes a form of highly structured, 
teacher-directed instruction, one way to view direct instruction is as a 
comprehensive system of instruction encompassing classroom organization 
and management, quality of teacher-student interactions, and design of 
curriculum materials (Gcrstcn, Woodward, and Darch, 1986). Direct instruc- 
tion conceived by Engclmann and colleagues is concerned with rigorous 
analysis of exactly how curricular materials should be constructed. In direct 
instruction programs, initial teaching involves explicit instruction on each 
step in the sequence. Thus direct instruction is an instructional program that 
meets the conditions as conceived by Roscnshinc, yet incorporating addi- 
tional features. Design components of direct instruction curriculum include 
mastery learning, carefully designed sequential instruction, flexible skill- 
level grouping, adequate opportunity for practice, and periodic review of 
previously learned concepts. 

A number of studies have found direct instruction superior to other 
instructional approaches for academically handicapped students in reading 
comprehension skills (e.g.. Patching, Kamccnui, Carnine, Gcrstcn, and 
Colvin, 1983; Carnine and Kinder, 1985), as we'll as math and reading 
achievement (e.g., Gcrstcn, Becker, Hciry, and White, 1984). Similarly, 
Stein and Goldman (1980), examining the effect of Distar Reading with 
primary-age learning-disabled students, found appreciable gains in reading 
achievement. 

The Exemplary Center for Reading Instruction (ECRI) is a highly 
structured, teacher-directed, mastery learning approach, utilizing a demon- 
stration-prompt-practicc format relying on the prior establishment of 
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classroom routines, and thus can also be designated as a direct instruction 
approach. Key instructional features include greater allotted instructional 
time, positive reinforcement, overt and rapid student response procedures 
making the levels of student opportunity to respond high, and specific 
procedures, to maximize actual instruction time. Although the method was 
developed and validated prior to the more recent effective teacher behavior 
literature, the approach shares many of the instructional components 
described by Roscnshinc and Berliner (1978) based on summarizing this 
literature base. 

Data regarding the effectiveness of ECRI has shown the method to be 
effective in significantly increasing student achievement for both regular and 
special education students (e.g.. Linn, 1980; Rcid, 1986; Vaux, 1979) as well 
as for compensatory education and minority students (Garner, 1975; Sloan, 
1979). 

Thus this approach offers individualized instruction as well as mastery 
learning while also incorporating several recommendations stemming from 
the teacher effectiveness literature, namely, increased ALT, positive rein- 
forcement and feedback, and increased opportunity for student response. 

Adaptive Learning Environment Model 

Adaptive instruction is defined as use of alternative instructional strategics 
and school resources to provide effective education for all students. One 
application, the Adaptive Learning Environment Model (ALEM), devel- 
oped at the University of Pittsburgh R&D Center, combines aspects of 
prescriptive instruction with aspects of open education for the purpose 
of basic skill acquisition (Wang, 1980). The five major components of 
ALEM arc. (I) a highly structured, hierarchical basic skills curriculum 
integrated with exploratory learning activities; (2) an instructional/learning/ 
management system to maximize class and school resources; (3) family 
involvement via integration of school and home learning experiences; (4) 
flexible grouping, team teaching, and peer tutoring to increase the amount of 
school time available for learning and the amount of time students actually 
spend on learning; and (5) a data-based staff development program. 

Data on the use of ALEM with disadvantaged and learning- 
handicapped students have shown achievement gains in reading and math 
(though generally not significant) as well as increased engaged time and 
enhanced student self-perception of cognitive competence, social com- 
petence, and general self-esteem (e.g., Wang and Birch, 1984; Wang and 
Reynolds, 1985; Wang and Walbcrg, 1983). 

ALUM can thus be described as a program that attempts to enhance 
use of school time and resources by specifically increasing the time available 
for instruction and learning. It incorporates individualized instruction, 
mastery learning, peer tutoring, and teaching variables derived from the 
tttieher effectiveness research relating to efficient use of instructional time. 
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Cooperative Learning Strategies 

Cooperative learning strategics primarily involve a change in the interper- 
sonal reward structure of the classroom from a competitive reward structure 
to a cooperative one. Most research on the practical application of coopera- 
tive learning techniques in a classroom setting has involved four models: 
T< ams-Games-Tournamcnt (TGT); Student Teams-Achievement Divisions 
(5 TAD); Jigsaw; and Small-Group Teaching (SGT) (Slavin, 1980). 

TGT establishes four- to five-member student teams with instructional 
tournaments. Students are assigned to teams maximizing heterogeneity. 
STAD uses identical team membership but replaces tournaments with 
simple quizzes that students take after studying in their teams. In Jigsaw, 
team structure is the same as for TGT and STAD but academic material is 
broken into as many sections as team members. Students study the sections 
with members from other teams with the same assigned sections and then 
return to their teams and in turn teach their sections to other team members. 
Team members arc tested but individual student scores contribute to 
individual grades only, therefore there is no formal group goal, SGT is a 
general classroom organizational plan in which learning takes place through 
cooperative group inquiry with students selecting topics and organizing 
themselves into small groups. 

Classroom cooperative learning techniques generally include any com- 
bination of the following dimensions: reward interdependence, task inter- 
dependence, individual accountability, teacher-imposed structure, and use 
of group competition. Research results from implementation of cooperative 
learning strategics approaches show a relatively consistent pattern of 
positive effects on both student achievement and self-esteem for regular 
nonhandicapped students (Slavin, 1980), 

In more recent work, Slavin, Madden, am! Lcavcy (1982) combined 
cooperative learning with individualized instruction to form Team-Assisted 
Individualized (TA1), Focusing on outcomes for mainstrcamcd students in 
particular, they found that TAI did positively affect the social status of 
mainstrcamcd students as well as teacher ratings of their classroom behavior 
and self-confidence. Similar positive results were not found for math 
achievement in a preliminary eight-week study; however, in a longer 
twenty-four-weck intervention, significantly positive effects on the math 
achievement of mainstrcamcd academically handicapped students were 
found (Slavin, Madden, and Lcavcy, 1984). Most recently, using a combined 
program of mixed-ability, cooperative work groups and skill-based reading 
groups (Cooperative Integrated Reading and Composition — CIRC), 
Stevens, Madden, Slavin, and Famish (1987) found positive effects for 
mainstreamcd academically handicapped students in both reading com- 
prehension and vocabulary. 

Similarly, considerable evidence exists that cooperative learning expe- 
q "'ences are more effective in promoting positive relationships between 
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handicapped and nonhandicapped students than competitive and individual- 
istic ones (Johnson and Johnson. 1975. 1978. 1983. 1984. 1985: Johnson 
Johnson, and Maruyama. 1983). 

Although cooperative learning experiences have generally been found 
to result in higher achievement than competitive strategics for overall 
student achievement, the evidence in terms of academically handicapped 
students in particular is somewhat less apparent (e.g.. Cosdcn. Pca.i. and 
Bryan. 1985; Johnson. Johnson, and Maruyama. 1983: Talmagc. Pascarclla 
and Ford. 1984). with the exception of the work by Slavin and his colleagues' 
which combines cooperative learning with other instructional features. 



Peer Tutoring 

The results of a meta-analysis of findings from sixty-five studies of school 
tutoring programs showed that peer tutoring has positive effects on the 
academic performance and attitudes of those tutored (Cohen. Kulik. and 
Kuhk. 1982). Although both structured and unstructured tutoring produced 
measurable effects, effects from structured programs were stronger, and 
effects were stronger for math than for reading. In delineating specific 
components required for successful peer tutoring. Jenkins and Jenkins 
(1985) noted the following: highly structured and carefully prescribed 
lessons, content based on the classroom curriculum and immediate teacher 
goals; requiring mastery to be demonstrated: a continuous program of 
moderate duration (fifteen to thirty minutes daily); and providing specific 
tutor trilinin g for givmg dear dircctions cncoura g in g and praising, confirm- 
ing correct responses, correcting errors in a nonpunitivc fashion, and 
avoiding ovcrprompting. 

Concerning handicapped students in particular, one study using trained 
high school students to tutor elementary special education students who 
tailed to master regular education curriculum objectives found that the 
tutored students made appreciable academic gains (Armstrong. Conlon. 
Pierson, and Stahlbrand. 1979) 

On application of peer tutoring, labeled the Classwidc Peer Tutoring 
System (CPD. was developed to improve instruction for minority, disadvan- 
taged, and/or learning-disabled students by increasing their opportunities to 
respond and their active engagement in lessons (sec Dclquadri. Greenwood 
Whorton. Carta, and Hall. 1986). The method is predicated on selection of 
skill activities for tutoring as closely aligned as possible with teacher 
outcome criteria. The system also includes behavior analysis principles for 
providing reinforcement for correct responding, such as individual and 
group contingencies, as well as social and token reinforcement. The program 
is structured to require systematic teacher review to provide reinforcement 
based on student progress. Also included arc peer-mediated contingencies 
implemented by establishing teams that cam points and compete for "tca^ 
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of the week." A third component consists of daily and weekly posting and 
feedback. 

Use of these techniques has been found to improve considerably the 
rate of active student responding (Elliott, Hughes, and Dclquadri, 1984), 
substantially reduce spelling errors for students with high prior error rates 
(Hall, Delquadri, Greenwood, and Thurston, 1982; Dclquadri, Greenwood, 
Stretton, and Hall, 1983), and increase the number of words read correctly 
for learning-disabled students (Greenwood. Dclquadri, and Hall, 1984). 

This approach basically increases direct teaching time while providing 
increased opportunity for active responding and a motivational system. 
Again, these components arc features primarily derived from the teacher 
effectiveness research. 



Although several labels have been applied to approaches that attempt to 
teach students how to learn (e.g., cognitive-behavioral training, mctacog- 
nitivc strategies, and, most commonly, learning strategics), all arc "learning 
to learn" strategics and include two levels: (I) teaching specific learning 
strategics and routines, followed by (2) gradually relinquishing the applica- 
tion to students. 

Basically, learning strategics are techniques, principles, or rules that 
enable students to learn, solve problems, and complete tasks independently 
(Dcshlcr and Schumaker. 1984). The ultimate goal of tnc approach is to 
enable students to analyze and solve novel problems while requiring 
students to accept major responsibility for their learning and progress. The 
Learning Strategics Curriculum (e.g.. Dcshicr and Schumaker, 1986) is 
organized into three major strands corresponding to the demands of the 
secondary curriculum: (I) acquisition of information from written material, 
(2) identification and storage of important information, and (3) strategics for 
facilitating written expression. Students exposed to this approach have 
shown marked gains in reading comprehension and reduced number of 
errors in written material (e.g.. Clark, Dcshlcr. Schumaker, and Alley, 1 934; 
Schumaker. Dcshlcr. Alley, and Warner, 1983). Dcshlcr and Schumaker 
note, however, that it is essential to provide careful staff development to 
bring about student progress since little change in student behavior has been 
noted in the absence of such training. 

Mctacognitivc strategics instruction, as described by Palincsar (1986). 
arc techniques designed to teach students to plan, implement, and evaluate 
strategic approaches to learning and problem solving. Palincsar has devel- 
oped a procedure labeled reciprocal teaching defined as "dialogue between 
teachers and students for the purpose of jointly constructing the meaning of 
text." The four activities include summarizing, question generating, clarify- 
ing, and predicting. The research on reciprocal teaching for remedial reading 
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students* results indicates that these students make substantial progress in 
accuracy of answering comprehension questions from memory as well as on 
tests of reading comprehension in subject areas, indicating generalization to 
these settings (Brown and Palincsar, 1982; Palincsarand Brown, 1984). 

Although learning strategics approaches were originally designed for 
mildly handicapped adolescents, the developers are currently attempting to 
determine if and how students in upper elementary grades can benefL from 
these interventions to prepare them better for the transition into secondary 
school. There has been some evidence regarding the possibility of training 
strategic use of both cognitive and metacognitive processes for younger 
students. Recent studies have shown that even primary grade students are 
able to abandon ineffective strategics and use appropriate strategies more 
effectively during deliberate memory tasks as a result of being trained to 
monitor and evaluate the utility of task strategics (e.g., Ghatala, Levin, 
Prcsslcy, and Lodico, 1985). 

Gcrbcr (1986) has recently suggested that the tendency to dismiss 
cognitive-behavioral techniques (CBT) with younger students because of 
assumptions about their cognitive immaturity and lack of basic knowledge is 
unwarranted. He defines CBT as a means for facilitating the organization of 
previous knowledge or behaviors into more efficacious problem-solving 
routines, and advocates that training begin in the elementary grades and be 
systematically developed to parallel the basic skills curriculum. 



SUMMARY 

It appears that the approaches that attempt to incorporate several features 
that jointly impact on the learner and the teaching process, as well as 
instructional delivery, have greater promise for increasing the achievement 
of academically handicapped students. Further evidence of this reasoning 
has been offered by Bloom (1984). He contended that two or more alterable 
variables used togethes would contribute more to learning than any one 
alone. Considering the direct object of the intervention (i.e., the learner), the 
instructional materials, the home environment/peer group, and the teacher 
and teaching process, Bloom speculated that two variables involving differ- 
ent objects of the change process (teacher, student, materials, learning 
environment) may, in some instances, be additive, whereas two variables 
involving the same target of the change process would be less likely to be 
additive. 

For improving the academic performance of handicapped students in 
the mainstream, a three-tier structure may be necessary, incorporating not 
only change in teaching behaviors and instructional method or materials, but 
altering the learner variables as well. The first level would call for establish- 
ing a learning environment most conducive to learning for academically 
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handicapped children (and in fact al! students). To operate such an efficient 
learning environment, teachers would teach students expected behaviors by 
using demonstration and modeling of the expected behaviors until they 
performed the new procedures automatically. They would maintain an 
effective surveillance system during instruction, attending to all students 
within and outside the lesson. Furthermore, they would maximize learning 
time by sustaining high levels of academic responding or practice and 
keeping students engaged in learning tasks. Lastly, they would arrange their 
grouping schedule to avoid long periods of seatwork and increase the 
percentage of students involved in direct instruction. 

The second level would involve using instructional methods and 
materials that are so economical and potent that reliable gains can be 
achieved independently of individual differences (e.g., direct instruction, 
phonics instruction). These methods serve to organize information more 
efficiently than low-achieving students can on their own by increasing the 
probability of correct responding and reducing response uncertainty. 

The third level would be the teaching of specific learning strategics to 
empower students to become more efficient learners, which may be the only 
strategy with the ultimate potential of reducing the time needed to master 
important skills in an instructional sequence. 

Although this strategy basically extends Bloom's approach of varying 
two dimensions of the learning process to varying three components, it 
represents the ideal situation — which may not be feasible in practice. 
However, clearly establishing any one of the three levels is likely to 
positively effect the academic progress of mildly handicapped learners, 
whereas any combination promises to be a more powcsful intervention, with 
a three-way merging, in all probability, being the most potent for enhancing 
the performance of low achievers. 
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Children who are having academic learning difficulties become eligible for 
various extra-instructional support programs under Chapter I (remedial 
reading), PL 94-142 (special education services), migrant education tutorials, 
and so forth. Over the course of a day these children move back and forth 
among these programs and their regular classroom program— instructional 
programs that may or may not relate to each other in any discernible way. In 
this chapter we will focus on the coordination among these programs. We 
think this coordination is important to study for several reasons. First, stud- 
ies of remedial programs have found coordination to be a characteristic 
of more effective programs (e.g., Griswold, Cotton, and Hansen, 1986; 
Vcnczky and Winficld, 1977). Second, researchers who have looked for 
coordination between programs have rarely found it (e.g., Johnston, Ali- 
ington, and Afflcrbach, 1985; Kimbrough and Hill, 1981). Third, there is a 
pervasive belief among teachers and administrators that children should have 
remedial reading programs that are entirely different from their classroom 
programs, which arc obviously not working (Aliington, 1986; Johnston, Ali- 
ington, and Afflcrbach, 1985). Some believe this because of particular views 
about learners and some because they believe regulations require it. Fourth, 
a most common feature of at-risk learners is a low level of tolerance for 
variations in instructional quality (Bloom, 1976, 1984). However, the at-risk 
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learner, the child who exhibits the most "cognitive confusion" (Vernon, 
1958), is the learner most likely to be confronted with an uncoordinated and 
inconsistent set of curricular approaches and instructional experiences (All- 
ington, Stuctzrl, Shake, and Lamarche, 1986: Moore, Hyde, Blair, and Wcitz- 
man, 3981; Kimbrough and Hill, 1981; Lcc, Rowan, Allington, Anderson, 
Bosscrt, Harnischfcgcr, and Stallings, 1986). 

In this chapter, then, we discuss studies that support these findings in an 
attempt to understand why extra-instructional support programs so seldom 
evidence coordination with each other or with the regular education pro- 
grams and why coordination mighi be a critical fcaturcof such programs. Our 
focus is primarily on efforts designed to promote literacy in at-risk learners, 
since, in our view, most instructional support programs wercoriginally devel- 
oped in response to learners' difficulties in acquiring literacy skills. 

We will discuss consistency at three levels of curriculum. The first is 
that which appears in print, in state or district curriculum plans, scope and 
sequence charts, in analyses of instructional materials (such as basal reader 
scries), and so on. A second level lies in the manner with which the teacher 
organizes the classroom, interacts with children, and presents the curriculum 
materials selected. The third is the curriculum as experienced by the learner. 
This is the most important level of curriculum and our discussion revolves 
primarily around the consistency of the curricular experiences of at-risk 
learners. Whereas state or district curriculum plans (or lack thereof) influ- 
ence the experienced curriculum in many subtle, and not so subtle, ways, we 
discuss those only briefly here. We focus on consistency in the experienced 
curriculum and on coordination of instructional efforts of the various person- 
nel who instruct the at-risk learner in the course of the school day. In our 
view, consistency in the experienced curriculum is unlikely without an effort 
to coordinate the instructional activities of ali personnel working with a given 
child. This coordination would involve collaboration between regular educa- 
tion and instructional support personnel. Collaboration in the design and 
delivery of instruction is a necessary beginning to move toward curricular 
consistency. 



PROGRAM COORDINATION 

School Systems 

In an earlier study we found evidence of curricular consistency in few 
schools. Most often, classroom teachers and support instruction teachers had 
little knowledge of the instruction offered to an at-risk learner by the other 
(Johnston. Allington. and Afflcrbach. 1985). Our analysis of their reports of 
curriculum materials used with the at-risk learner suggested that, more often 
than not, these learners were experiencing curriculum conflict. That is. the 
curricula experienced in the regular classroom reading instruction and that 
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experienced in the support instruction program were dissimilar in nature and 
philosophically incompatible. The few instances of curricula consistency 
were generally found in schools where key administrators believed such 
consistency was important and where classroom and support teachers com- 
municated and collaborated with one another in planning the instructional 
intervention. Similar findings arc noted by Moore and his colleagues (1981) 
Birman (1981). and Kimbrough and Hill (1981). suggesting that district-level 
administrators and districtwide policies can have substantial influence on 
whether curricular consistency, coordination of efforts, and instructional col- 
laooration will occur in schools. However, as Kimbrough and Hill (1981) 
note, few districts (or states) "appear to make serious attempts at coordi- 
nation" (p. 42). Bogdan (1983) argued that the lack of coordination was 
evidenced by a lack of clarity in who was responsible for ensuring the 
appropriateness of instruction for handicapped learners. That is, it was un- 
clear in many districts whether the special education director or building 
principal was responsible for ensuring appropriate instruction was offered in 
the mainstream classroom. Similar confusion often existed between the 
mainstream teacher and the resource room teacher. The point here is that in 
the absence of any district-level plan for achieving curricular consistency and 
instructional collaboration, one should probably not be supriscd by the ab- 
sence of such consistency and collaboration in the instruction experienced by 
at-risk learners. 

School Level 

When there is little district-level activity to enhance coordination, attempts at 
coordination will fall to building-level administrators or the instructional staff 
of the core and support programs. Perhaps because of the autonomy afforded 
many school-level administrators, coordination efforts at this level, in vari- 
ous forms, seem more common than district-level plans and policies yet 
these activities still occur infrequently (Hannifin and Barrett, 1983) 

We have argued that at-risk learners need larger quantities of high- 

T ,ty ,,«o!« ,Cti °. n * :md yct Ligon and Doss (l982 >' Archambault and St. 
I.crrc (1980). and Kimbrough and Hill (1981) all report that participation in 
support programs often serves to replace core curriculum instruction 
students served by various support programs actually ended up with less 
instructional time than students not served. In comparing schools with Chap- 
ter 1 programs to schools without such programs, others also report thai 
students in Chapter I schools received less academic instructional time than 
students m non-Chapter I schools (Stanley and Greenwood, 1983). Similar 
results arc reported for handicapped students attending resource room pro- 
grams from mainstream classroom placements (Hayncs and Jenkins, 1986- 
Ysscldykc. Thurlow. Mecklenburg, and Gradcn, 1984). Thus the available 
evidence suggests that coordinating support program instruction with core 
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curriculum instruction in order to provide targeted students with a consistent 
curricular approach and additional amounts of academic instructional time is 
not particularly common. Indeed, it seems that the lack of coordination often 
results in reduced services to the students. 

If coordination demands communication between the core and support 
instructional staff, 'he evidence is also less than encouraging. Cohen, Intilli, 
and Robbins (1978) report a survey of teachers in forty-six elementary 
schools in which they found that more than half of the classroom teachers 
reported that reading resource teachers rarely or never offered instructional 
information, suggestions, or materials. Bogdan (1983) reports that core pro- 
gram teachers and special education teacher* indicated substantial confusion 
about who was responsible for instructional planning and delivery. His report 
parallels some of the situations discussed by Moore and colleagues (198!) in 
which both the support program teacher and the core program teacher 
thought the other was responsible for reading instruction. 

In an interview study conducted in ten school districts in two states, we 
found that nearly one-third of core program and support program teachers 
could not remember when they had last had cither an informal or formal 
discussion of remedial students* needs, progress, or concerns (Johnston, 
Allington, and Afflcrbach, 1985). In addition, there was wide variability 
between the pairs of respondents in terms of these meetings. Support teach- 
ers reported more frequent and longer contacts with students than did the 
classroom teachers with whose students they worked. When queried on 
instructional needs and goals for specific remedial students, there was little 
agreement between the responses of the classroom and remedial teachers. In 
a further assessment of communication, we asked classroom and remedial 
teachers about the curricular materials used in each instructional setting 
(regular classroom and support instruction). Two of every three support 
program teachers were unable to identify the reading instructional material 
the remedial student used in the core classroom reading context. Fewer than 
one in ten core classroom teachers could name the material that remedial 
students from their classrooms used during support program instruction. 
Similar results were obtained in interviews conducted with participating 
teachers in two observational studies (Allington and McGill-Fran/cn, 1989; 
Rowan, Guthrie, Lee, and Guthrie, 1986). 

Several studies have sought to assess coordination between the core 
and support instruction offered students. These studies reported observa- 
tional analyses of the consistency of instruction in classroom and remedial 
reading contexts. Allington and associates (1986) observed students enrolled 
in remedial programs, though the observations included both core curriculum 
and support program instruction. The qualitative field notes were examined 
for several types of potential instructional consistency or congruence. 
However, inconsistency of the experienced curriculum was found most fre- 
quently. The types of tasks, the specific skills emphasized, and the nature of 
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the instructional materials more often differed than matched. In short, the 
instruction received by the remedial students in the two settings seemed 
generally independent of each other. This occurred even in an in-class reme- 
diation design in which support program personnel offered remediation in the 
core curriculum classroom. Similarly, Rowan and colleagues (1986) reported 
little evidence of instructional coordination between Chapter I and rlassroom 
instruction in schools in six states. Where coordination was observed, 
however, it was typically planned in the district- or school-level design. 

Pike (1985) reports on Chapter I programs in four elementary schools in a 
district where coordination was emphasized. This study of primary-grade 
students indicated a fairly high degree of congruence between the decoding 
skills taught in the core curriculum classroom and in the support program. 
The district provided a management checklist indicating the scope and se- 
quence for decoding skills in the adopted basal reader scries and remedial 
teachers were to use this as a guide for instructional intervention. Most did, 
and although the remediation tended to focus only on this narrow band of 
reading behaviors, it was generally consistent. However, the teacher inter- 
views did not support the suggestion that communication between core and 
support staff was frequent. Coordination was less evident in some schools 
than in others, seemingly a result of support staff personality characieristics 
and professional beliefs. 

The Hayncs and Jenkins (1986) study, noted earlier, indicated little 
evidence of coordination among core and support program instruction for 
elementary special education students. They also noted wide variability in 
the amount and nature of both core and support program reading instruction 
offered to students who had seemingly similar instructional needs. Licopoli 
(1984) offers one of the rare observational studies of high school support 
programs. He noted few instances of coordination and suggested that partici- 
pation in special education resource rooms offered little in the way of pre- 
paring handicapped students for participation in mainstream classes. Here, 
however, there seemed to be little agreement in the school about the funda- 
mental purpose of the resource room support program, with quite divergent 
views offered by school-level administrators, resource room support staff, 
and core curriculum instructional staff. 

The incidence of school-level coordination seems low and the level of 
coordination is quite variable in the available reports. Moore, Gocrtz, Hartlc, 
Winslow, David, Sjogren. Turnbull, Colcy, and Holland (1983) seem to char- 
acterize best the current situation in this area by noting that * 'program 
coordination both in the sense of deliberate steps to dovetail instructional 
efforts and in the sense of less deliberate actions to make programs somehow 
fit into the school setting almost invariably fall to those actually delivering 
services to students . . .*\ f p. 100). Thus, although some have pointed to the 
principal as a key factor in achieving school-level coordination (Moore ct al., 
1981), more often that not such leadership seems to be lacking. In most 
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instances, neither district-level nor building-level administrators seem to 
provide adequate leadership for achieving either coordination between core 
curriculum programs and support programs or among the various support 
programs delivering services to at-risk students. 

However, we need to return to the evidence that curricular coordination 
does occur, though not frequently. Too rarely did coordination appear as a 
result of general program design or district-level administrative planning. 
Often, these instances of coordination were the result of "teacher deals" 
(Biklen, 1985). That is, a support teacher (e.g., remedial reading, learning 
disability, or speech/language specialist), through his or her own efforts and 
on the strength of acknowledged professional expertise and personal style, 
managed to collaborate with some or many of the core curriculum classroom 
teachers whose children were served. As Biklen notes, this coordination is 
often unnoticed, or if noticed not recognized, by key administrators. In 
addition, we agree with Biklen when he notes that although intervention 
programs for at-risk children ultimately depend on the expertise of the teach- 
ers providing the services, programs should not be hinged on teacher effort 
alone. That is, Allington and colleagues (1986) note that much well- 
intentioned instructional effort was unleashed in the compensatory education 
classes they observed. But, at the same time, the lack of leadership at higher 
levels left the support teachers primarily to their own resources and, at times, 
denied them even that. Biklen (1985), like others, noted that support of the 
school principal was important to successful mainstrcaming of handicapped 
children. Mainstrcaming might ensue without active support but only on a 
limited and uneven basis. However, any resistance by the principal spelled 
doom for the effort. We think there is a parallel for the development of 
coordinated education programs for all at-risk learners. Ideally, principals 
would provide the leadership for such redesign but at the least they would not 
impede development of such efforts. 



THE EFFECTS OF SCHOOL-LEVEL 
PROGRAM COORDINATION 

School-Level Coordination 

Although program coordination at the school level is reported as uncommon, 
there does exist some evidence to indicate not only that such coordination 
can be achieved but that these efforts can enhance student achievement. For 
instance, a report by the New York State Office of Education Performance 
Review ( 1974) noted that schools that were considered successful had greater 
amounts of interaction between classroom teachers and support program 
teachers combined with an emphasis on instructional continuity. Likewise, 
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Vcnczky and Winficid (1979) found that more effective schools were more 
likely to have integrated support services. In their review of effective com- 
pensatory education practices, Griswold, Cotton, and Hansen (1986) noted 
that coordination of the regular program with other support programs was 
one of the seven organizational attributes of successful Chapter I projects. 
Gczi (1986) also notes that coordination of compensatory instruction with 
core curriculum was one feature of leadership provided in successful com- 
pensatory education schools. 

Nonetheless, we have argued elsewhere (Allington and Shake, 1986; 
Johnston, Allington, and Afflcrbach, 1985) that as attractive as coordination 
among programs appears, we still have little understanding of the effects of 
coordination of any of the several types discussed and similarly impoverished 
notions of how such coordination can be routinely achieved. There is some 
reason to believe that coordination in terms of more appropriate scheduling of 
services could result in greater amounts of basic skill instruction being de- 
livered to target students. Whereas expanding the quantity of instruction 
seems likely to affect achievement favorably (Coolcy and Lcinhardt, 1980; 
Greenwood, Dclquadri, Stanley, Sasso, Wharton, and Schul'- 1981; Kie- 
sling, 1978) and would rectify the current situation reported in schools in 
which support program participation seems to decrease quantity of instruc- 
tion, such efforts seem minimal and still insufficient. In a similar vein, we 
suppose that coordination that attempts to link the content of instruction will 
result in enhanced achievement. Here, however, we have few reports to 
buttress the argument. Winficid ( 1987) assessed the achievement of Chapter I 
participants on reading skills that were emphasized by both the Chapter I 
teachers and the classroom teachers against those skills emphasized by one 
set of teachers or the other. She reports that the dual emphasis skills were 
more likely to be mastered than the single emphasis skills, suggesting that the 
coordinated effort produced better learning and retention. However, in her 
data it was only lower-level reading skills (simple decoding tasks) that re- 
ceived the dual emphasis, and the evidence suggests that these skills arc 
relatively easier to master than some of the more complex skills that were not 
emphasized (e.g., drawing inferences from text). Thus, although supportive, 
these data cannot provide an adequate base for recommending curricular 
coordination. This is not to argue that achieving curricular coordination has 
little to recommend it, but simply that we currently lack any substantial 
empirical evidence. Indeed, later we will argue on theoretical grounds that 
curricular coordination is critical. 



Federal, State, District, and Local Policies 

Currently there is much opinion but little hard evidence to support any 
conclusion about the effects of policies developed at any level, except that 
policymaking begets policymaking. Our analysis of the various reports indi- 
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catcs that two views exist about the effects of federal policies concerning the 
development and delivery of support program instruction. In one view, fed- 
eral policies are the root of all of our problems. Moore and associates (1983) 
argued that the federal goal of isolating federal monies from siate and local 
funds contributed to the administrative separation of tfje various support 
programs and that the full federal funding of Chapter I produced only minimal 
state bases for support of compensatory education. Cohen (1982) suggests 
that "specialization at the lower level parallels develop*A<Ats higher up; it 
also impedes coordination of activities within schools and districts . . (p. 
481). Ginsburg and Turnbull (1981) echo this view with their statement that 
"federal fiscal controls have also unintentionally encouraged schools to iso- 
late federal programs from each other and from the regular school program" 
(p. 36). This view is also offered by Kacstlc and Smith (1982). 

The common theme in these analyses is that much, if not most, of the 
current fragmentation can be attributed to the rather patchwork development 
of federal policies concerning the various laws, regulations, and programs 
intended to guarantee or enhance the education of target populations. This 
fragmented development has been coupled with continuously shifting regula- 
tions within programs and shifting interpretations of the same regulations 
(NIE, 1977). On the other hand, the analysis by Moore and colleagues (1981) of 
implementation at the local level led them to conclude thafttatc and federal 
regulations had slight impact in the decisions made by district- or building- 
level personnel. In addition, they note that they found little evidence that any 
comparable level of effort to meet the needs of at-risk students would have 
existed in the absence of federal policies. Similarly. McLaughlin (1982) notes 
that "federal policies cannot, by themselves, cause particular outcomes; they 
must be implemented within and through existing institutional arrange- 
ments . . (p. 567). 

As a result of our analysis of the various reports, we have adopted what 
might be considered an interactionist perspective. Like Lcinhardt, Bickcl, 
and Pallay (1982) and Stainback and Stainback (1984). we sec much of the 
existing federal policy structure as unnecessarily duplicative, unwieldy, and 
influential in producing the fragmentation so often noteji. However, the 
development of federal policy has often followed shifting social beliefs about 
the nature of school failure and responded to political pressure brought to 
bear by different interest groups (Johnston, McGill-Franzcn, and Allington, 
1985). Federal policy has as often reflected professional wisdom and social 
beliefs as it has nurtured or refined them. For example, the development of 
the pullout model, which has been so widely adopted in compensatory and 
special education, seemed to follow the professional wisdom of a previous era 
as much as a desire of the federal agencies to develop clear audit trails. The 
small-group clinical model adopted by Chapter 1 teachers and special educa- 
tion teachers mirrored the clinical model so prevalent in university -based 
clinics where these support teachers received specialized training. The use of 
separate and different curricular materials to meet the presumed special 
9 
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instructional needs of special populations is likewise part of the professional 
wisdom (Cook and Earlly, 1979; Dechant, I98l;GiUiland, 1974). An example 
is the "differential teaching" model that has dominated compensatory and 
special education training for so long (Johnston, Allington, and Afflcrbach, 
1985), and also fits very nicely into some misinterpretations of the "sup- 
plement, not supplant" provision of Chapter I. 

It seems undoubtedly true that the pullout model of instructional deliv- 
ery has contributed to the fragmentation of the instruction received by at-risk 
students. However, it seems unlikely that federal policies can be wholly 
blamed for the institutionalization of the model. On the other hand, local 
administrators who prefer a safe bet in anticipation of a compliance audit 
might be edged away from other designs by the widespread understanding 
that federal policies have cast the pullout design in a favorable light. Like- 
wise, state and district administrative organizational structures may have 
been refined by federal policies, but rarely docs it seem that they were 
created by such policies. We feel we can argue this simply because of the 
wide variability in existing administrative structure one can currently find. 
Conventional wisdom and special interest influences largely drove federal 
policy and thus the patchwork quality of much of the federal effort (McGill- 
Franzcn, 1987). But without federal programs and policies, many of the 
current efforts to alleviate school failure in at-risk children would be nonexis- 
tent (Moore ct al., 1981). 



ASSUMPTIONS AND THEIR CONSEQUENCES 

A number of assumptions seem to have guided federal policymaking in the 
past twenty years, the era when the federal role expanded significantly. 
These assumptions were generally not the result of the federal policies but 
rather preceded them. The first, and probably most critical, assumption is 
that there exist several identifiable categories of at-risk children. Implicit in 
this assumption arc the notions that (I) children in these several categories 
have different instructional needs, (2) these categories are rciiably different 
from one another, and (3) we can identify children that fit in each category. 
Unfortunately, as has been argued elsewhere (Algozzinc and Ysscldykc, 
1983; Johnston, McGill-Franzcn, and Allington, 1985; Lcinhardt, Bickel, and 
Pallay, 1982; Stainback and Stainback, 1984). there is little theoretical or 
empirical evidence to support any of these assumptions. These assumptions 
were not created by the federal policymaking but rather made their way into 
policy that has maintained and strengthened them. 

The influences that have led to the fragmentation of support instruction 
services have fed and nurtured each other. As educators, sociologists, psy- 
chologists, economists, and others have created their various rationales for 
explaining school failure, a variety of beliefs about the etiology of school 
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failure have arisen and have affected federal policy. In her analysis of expla- 
nations for school failure, McGill-Franzcn (1987) traces the evolution of 
economic disadvantagedness as a primary rationale and the resultant devel- 
opment of the Federal Title I programs designed to compensate for such 
disadvantagedness. The number of publications in professional journals with 
both reading and disadvantagedness as key descriptors rose steadily for 
better than a decade but then gave way to another developing belief system. 

This shift was evidenced in the rise in the number of professional journal 
publications using the key descriptors "reading and learning disability." In 
the past decade the number of such articles has risen tenfold and ten years ago 
replaced the previous standard of "reading and disadvantagedness." In fact, 
articles on reading failure arc now more frequent in journals published for 
special education and learning disabilities professionals than in journals pub- 
lished for reading professionals (McGill-Franzcn, 1987). These trends in 
publishing seem to mirror the categorical identification of students and the 
expenditure of funds for providing support instruction. That is, the recent rise 
in the numbers of children identified as learning disabled, and the related rise 
in expenditures for the scrv ; ccs provided, parallels the increase in pub- 
lication with a brief time lag. However, there is little evidence that the 
passage of PL 94-142 created this trend since the number of articles on 
reading and learning disability had reached significant proportions before the 
enactment of the legislation. On the other hand, the legitimization of the 
category through the development of federal policy seems to have had an 
enormous effect. Prior to inclusion of learning disability as a fiscally reim- 
bursable category, relatively few school districts had any significant number 
of students so classified. Nov/ the situation has changed dramatically 
(McGill-Franzen, 1987; Ysscldykc and Algozzinc, 1982). 

Even though the number of children living at the "poverty level" has 
risen above earlier levels, the number of disadvantaged children served by 
Chapter I programs continues to decline (McGill-Franzcn, 1987). We sec 
the trends as reflections of shifting social beliefs about reading failure- 
beliefs not born in federal policymaking but nurtured by it. To reiterate our 
earlier point, however, there is little evidence to support the notion that 
children identified as learning disabled exhibit any substantial psychometric 
or educational differences from economically disadvantaged children who 
experience school failure (Algozzinc and Ysscldykc, 1983). Rather than 
new populations with newly discovered etiological bases for reading failure, 
what we have is shifting or developing social belief systems that attribute 
reading failure to an ever increasing array of etiologies — etiologies most 
commonly located in the learner. A common feature of these developing 
social beliefs and resultant intervention programs is the lack of concern for 
the possible deficiencies in the original educational environment in which the 
deficits were nurtured (Bogdan, 1983; Gclzhciscr, 1987; StainbacK and Stain- 
back, 1984). When children fail to learn to read competently after some 

ERIC t'M 



330 Coordination. Collaboration, and Consistency 

schooling wc rush to identify them categorically with one program or another, 
but regardless of categorical identification, there seems little evidence of 
concern for adopting or improving the original educational program from 
whence they came. This "fix-it** mentality works against collaborative ef- 
forts to remedy reading failure because it presumes the etiology is centered in 
a deficit in the learner, not in the educational program. Although it may be 
less distressing to the educational system to blame the victim, it is unlikely 
that wholehearted efforts for coordinated instructional planning will ensue 
until this belief system is modified. 

Although not as popular in the current debates, the same could be said 
of differences between migrant populations and either learning disabled or 
other economically disadvantaged nonmigrant children, but there is a sepa- 
rate federal program for that population also. The key qualities of effective 
support programs seem generally similar regardless of student classification 
(Le.nhardt and Pallay. 1982). Federal policy, then, has influenced the design 
ot state and local programs by providing the various categorical programs, 
but the policies seem to have mirrored emerging social belief systems. The 
development of the policies strengthened these social beliefs, and categorical 
separation became even more evident and entrenched. 

The problem has been compounded by the interpretation— or misin- 
tcrprctahon-of federal policy at the local level. The most obvious example 
is the "supplement, not supplant" provision of Chapter 1. There arc numer- 
ous references in the literature to the misinterpretation of this regulation 
suggesting commonly that the misinterpretation led to the widespread use of 
the pullout model for delivery of compensatory instruction (NIE 1977- 
Shulman. 1983). As Allington (1986) reports, this aspect of the regulation still 
seems widely misunderstood, since better than one of three compensatory 
education administrators surveyed viewed the provision as disallowing both 
m-class designs and the shared use of curricular materials in both settings. 
Nonetheless, wc noted earlier that the pullout design also mirrored the most 
common organization for remedial services delivered in the university-based 
reading clinics that trained the reading specialists who organized the remedial 
services in schools. Thus the small group, in a small room, with a specially 
trained teacher, using specialized curricular materials and methods fit well 
into both the professional belief system and the federal policy. This design is 
being perpetuated in programs for the learning disabled even though no 
similar federal policy exists (Junkala and Mooncy. 1986). 

Federal policy under PL 94-142 requires placement of handicapped 
learners in the least restrictive environment as a matter of course. However 
the design of support services for the learning disabled leans heavily toward 
pullout from the least restrictive environment for services delivered in sepa- 
rate resource rooms. Undoubtedly, there arc a variety of reasons why the 
pullout design has evolved as the most popular and some of these arc related 
to federal policies, but other reasons also exist (Bogdan. 1983: Lortic. 
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1976; Milofsky, 1976). Wc do not wish to argue that in-class programs arc 
inherently superior — they have their own set of problems — but congruence 
between instructional programs may be less likely in pullout programs. 

Regulations that presume categorical identification and identifiable in- 
structional needs based upon the categorical label will obviously shape the 
policies made at lower levels, especially if fiscal incentives arc present 
(McGill-Franzcn, 1987). However, as Leinhardt, Bickel, and Pallay (1982) 
argue, the established professional beliefs and alliances sustain and refine the 
policies made at the service delivery level. The separate federal programs 
administered from different federal offices often present a maze of conflicting 
infonnation. This maze is often further complicated by state laws and regula- 
tions and state level interpretations of federal policies. By the time the bundle 
reaches the local district, much less the local school building, the task of 
coordinating may seem unmanageable (McLaughlin, 1982; McLaughlin, 
Shields, and Rczabck, 1985). Although more attention to the coordination of 
support services at the federal level would certainly help, that simply will 
not ensure that services at the local level provide coordinated and coherent 
instructional environments for the children served. Cur pessimism should not 
deter any attempts at clarifying federal policies, but wc believe that it will 
take a major rcconccptualization of both reading failure and support program 
design to undo what has already been so unfortunately achieved. 



Instructional Congruence. Wc have reviewed a number of studies 
that have stressed the importance of coordination between instructional 
programs. Few of these reports have described the features of such pro- 
grams. Some have discussed the coordination in terms of the congruence 
between the curricula in the two contexts (Allington and Shake, 1986; John- 
ston, Allington, and Afflcrbach, 1985). However, cxactty what it means for 
instruction to be congruent has not been made clear. For example, wc cannot 
sec a remedial reading teacher bound to replicate a classroom reading pro- 
gram that is narrowly defined or in other ways unsatisfactory. Also, wc would 
not like to sec a basal reader approach in a classioom program prevent the use 
of other materials, such as trade books or predictable language texts, in the 
support program. Thus we need to clarify the notion of instructional coordi- 
nation so that it is not interpreted simply as "more of the same." 

Perhaps wc should explain our notion of what would be appropriate 
before addressing what wc feel would be inappropriate. Wc view literacy 
development in terms of broad concepts about communication and social and 
personal development. Within these higher-level concepts lies a substantial 
variety of conceptual and procedural knowledge that individuals integrate in 
the pursuit of various goals in various circumstances. The individual skills, 
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unorganized and unintegratcd, arc of limited help. Similarly, they are of little 
help if they arc not automated or if they are used inflexibly. Optimally, 
children will develop within a balanced language arts program. By balanced, 
we mean a program that emphasizes a variety of means of controlling the 
difficulty of text, such as predictable language, familiar content, prereading, 
rereading, word frequency, meaning unit repetition, interest level, recency, 
and so on, integrated with a writing program. Through a careful balance of 
such means of controlling difficulty, children can develop their weaknesses 
while being allowed success through the support of their strengths. For 
example, in the language experience side of a program a child is supported by 
his or her personal knowledge, syntax, vocabulary, and the personalized 
nature of the text, while he or she works on developing an understanding of 
graphophonic relationships through writing. Patterned language reading ma- 
terials, on the other hand, support the child through the predictability of the 
language, while introducing new vocabulary, new syntax, and new experi- 
ences. It is not difficult to tell whether children arc involved in a literacy- 
related task. However, it is hard for researchers or administrators who do not 
understand children's literacy learning to distinguish a balanced literacy 
program from an uncoordinated one and a coordinated one from a rigid, 
unbalanced program. 

There arc several major elements in this coordination issue. First, there 
is congruence between curricula — what is to be taught, in what order, and 
using which materials. Second, there is the method of instruction— the actual 
techniques that teachers use to help children learn the curriculum. Conflicts 
arise in the following sets of circumstances, which we have encountered all 
too frequently in school settings ( Johnston, Allington, and Afflcrbach, 1985): 

1. The strategics used to control text difficulty in the materials in the two 
settings arc radically different, as when word frequency and natural language 
rule in the one setting but extreme orthographic regularity reigns entirely in 
the other setting. In such a situation, the reading strategics that arc learned 
and practiced in one setting will be extremely inefficient in the other situation. 
One might sec such conflict in a school where at-risk learners receive reading 
instruction from a core curriculum based on the Scott Forcsman Reading 
Systems (197!), an approach emphasizing predictable and natural language 
text, and from a >upport program (offered in remedial or special education 
resource rooms) that uses the DISTAR curriculum (Engclmann and Bruncr, 
1975). 

2. The assumed hierarchy of learning in the two settings conflicts, such as 
when one setting focuses strongly on comprehension yet the other setting 
emphasizes decoding as the essence of reading. In one case, the development 
of prediction will be encouraged, whereas in the other setting it will be 
discouraged, One might sec such a conflict in a school where the Houghton- 
Mifflin (Durr, 198 1 ) basal is the source of core curriculum reading instruction 
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and the Merrill Linguistic Readers (Wilson , 1980) is the curriculum of support 
programs. 

3. The strategies to be learned differ from one setting to the other. For 
example, long division can be taught in terms of borrowing or through con- 
ceptually quite different procedures. 

4. Instructional strategies differ radically in the two settings, as when one 
setting depends entirely on teacher-directed instruction and the other setting 
depends on learner-directed instruction. In such a case the latter setting will 
stress self-monitoring and self-correction, whereas the former will prevent 
them. 

5. The terminology and metaphors differ in the two settings, causing the 
child to become confused about some of the concepts. For instance, when 
use of a word family (e.g., ake, ante) approach to decoding is coupled with a 
synthetic phonics approach (or when word families of the classroom become 
phonetic keys in the support setting). 

If the two parts of a program could easily be seen operating uilhin a 
coherent classroom program, then they are likely to be more or less con- 
gruent. There is the risk that with widely divergent approaches the children 
will be unable to integrate what they arc being taught and will develop 
confused notions of the nature and purpose of reading. "Cognitive confu- 
sion" was the term Vernon (1958) used to describe the problem, and some of 
its consequences have been documented by Johnston (1985). Thus if two 
parts of a program arc necessarily separate, then some part of the program 
needs to be devoted to making clear how the parts fit together. Otherwise, the 
integration— the tough bit— is left up to the children. It also seems likely that 
conflicts stemming from programmatic differences will be intensified with 
factors such as anxiety, which arc often quite severe for failing learners. It is 
probably more efficient to ensure coordination from the ousct. 

We feel that the major aspect that affects these dimensions of instruc- 
tion is the teacher's expertise. If we can consider both classroom and support 
teachers as experts, then presumably the support teacher is more expert than 
the classroom teacher in a specific area such as reading or mathematics 
curriculum and instruction. Of course, there arc exceptions to this, and too 
often poorly trained teachers' aides provide the remedial instruction. Experts 
differ from novices in three major ways. They differ in terms of the extent of 
their knowledge, the structure of their knowledge, and the flexible use of their 
knowledge (Chi, Glaser, and Rccs, 1982). It is their knowledge of teaching of 
a specific domain like reading, and how children learn, that makes them 
choose materials and techniques (to the extent that they perceive that they 
have choice in each area). The structure of teachers' knowledge becomes 
apparent when they teach and when they choose materials, and the differ- 
ences can be substantial (DcFord, 1985). Unfortunately, these ideological 
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differences arc most pronounced at the beginning stages of reading and, as 
Carter (1984) and Clay (1982, 1985) have pointed out, the earlier remedial 
programs begin, the greater the likelihood of success. Cordination is most 
critical where it is least likely. 

In some sense, then, there arc essentially three curricula involved. One 
is the explicit curriculum formally expressed by the school curriculum. The 
second is the implicit curriculum held in the knowledge structures of the 
teachers and expressed in their interactions with the children. The third 
curriculum is that experienced by the child. Program planning decisions can 
be made on the basis of the explicit curriculum, but the moment-to-momcnt 
decisions must be automatic or intuitive, and will be made on the basis of the 
implicit curriculum. Neither of these decribes the curriculum actually experi- 
enced by a given child. When the coordination issue is viewed this way, the 
clarity and coherence of the overall school curriculum becomes important. 
There should be a clear, explicit reading curriculum, including the philosophi- 
cal underpinnings of the curriculum, and it should encompass descriptions of 
how instruction for students with special needs should differ from that of 
regular students needs if it is indeed felt that it should differ in some way. 
However, ongoing discussions and observations between teachers in the 
context of day-to-day instruction arc more likely to induce congruence be- 
tween the written and expressed curricula. Only close monitoring of the 
child's development will provide information on the child's experienced 
curriculum. 

If a school seriously adopts a basal scries, there is likely to be some 
coherence in the overall reading instruction provided. Indeed, some evidence 
suggests that effective schools arc more likely to have a schoolwidc adoption 
than arc less effective schools (Clark and McCarthy, 1983). This may be 
because the basal curriculum, while constraining decisions, docs provide a 
more coherent schoolwidc curriculum than an "every teacher for himself/ 
herself" situation in which no common basal or other unified curriculum 
exists. However, a unified curriculum, whether or not achieved through a 
basal scries, does not ensure compatibility of the curricula implicit in the 
teachers' knowledge structures, which is a much more difficult congruence to 
attain and infinitely more important. For example, with respect to reading 
instruction, the training undertaken by teachers preparing to serve learning- 
disabled students is generally substantially different from that undertaken by 
teachers training specifically to serve students with special needs in reading. 
This difference is ideological and influences how students' behaviors are 
interpreted and how ihey arc seen as different from other students ( Ysscldykc 
and Algozzine, 1982). 

The effects of teachers' ideological differences might be minimized 
through a clear curriculum, cooperative planning of that curriculum, coordi- 
nated inscrvicc programs, and constant opportunity for and encouragement 
to engage in dialogue. Many schools' curricula arc defined by the basal scries 
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adopted, and serious schoolwide adoption constrains some dimensions of 
classroom instruction. For example, each basal scries opts for particular 
forms of text difficulty control. It would not be easy for a teacher to stress 
prediction and self-monitoring if the basal program selected and enforced 
were the Merrill Linguistic Scries, for which unpredictable text has deliber- 
ately been constructed, difficulty control being through restriction of the 
letter-sound relationships. In the same way, a developmental reading pro- 
gram based on reading trade books stressing self-monitoring and self- 
correction in the classroom would seriously conflict with a direct instruction 
program such as DISTAR (Engclmann and Bruncr, 1975) in the support 
program, which stresses immediate feedback by the teacher where errors 
occur. 

To illustrate, we provide samples of beginning reading texts from 
commercial programs with quite different forms of text difficulty controls 
(sec Figure ll-l), 'The Bus Ride" excerpt is an example of a patterned 
language text where the repetition and common topic all serve to case the 
difficulty for the beginning reader (as docs the rather limited vocabulary and 
accompanying artwork). The code-emphasis excerpts, on the other hand, 
attempt to control text difficulty by restricting the range of word families 
encountered, but in so doing create text with unnatural syntax and rather 
nonsensical form (and no traditional story structure). Children experiencing 
difficulty in cither curricuiar material may well be further confounded when 
presented with both. The appropriate strategics for reading cither text are 
simply quite divergent from the other. 

To return to the question of balance, then, an expert support teacher 
might observe the classroom teacher's reading program and decide that it is 
a balanced program, but that within the program, a certain student requires 
special attention in a specific area. Such instruction would include the 
broader framework particularly by highlighting how the focal points fitted 
into the broader program. For example, the student may require additional 
attention to hearing sequences of sounds in words. This can be difficult for 
some children to pick up in a group setting. The support teacher could work 
on that in the context of writing and/or reading real stories. On the othei 
hand, observation of the classroom instruction might reveal an unbalanced 
program. For example, suppose a classroom teacher's program consisted 
almost entirely of completing skill sheets emphasizing phonic analysis. 
Difficulty control in such a program is accomplished b> constraining the 
letter-sound relationships, and success is defined by accurate pronunciation 
of words in isolation. A support teacher could decide to work with the 
classroom teacher or try to shape up the balance independently. The former 
involves extensive communication, which currently seems lacking. The 
latter requires not simply teaching the untaught skills, but teaching in such a 
way that the instruction cnsu r cs the integration of skills learned in the 
regular program with those learned in the remedial program. Again, this 
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FIGURE 11-1. Examples of Reading Material Using Different Approaches for 
Controlling Text Difficulty 

The fox is on a log. 
The fox is by the sty. 
Can Red get Pub? 

Yes, the fox has the pig by a leg , . . 
Sourt c. Excerpted from The Fox and the Cat. Beha\ loral Research Labs, Palo Alto. CA, 

mi. 



The man had the log on sand and the log got hot. The log had 
tag fog. The log got hot and the cod got hot , . . 

Sottn r. Excerpted from P. J. McJnnis and S. K. Curtis ( 1982). The Cursive Apprtuu h to 
Readiness ami Reading. Hammondsport, NY; M/C Publications. 



The Bus Ride 

A girl got on the bus. 

Then the bus went fast 

A boy got on the bus. 

Then the bus went fast. 

A fox got on tho bus, 

Then tho bus wont fast. 

A hippopotamus got on the bus. 

Then the bus wont fast . . 

A bee got on tho bus. 

Thon . . . 

The hippopotomus got off the bus. 
The fox got off the bus. 
The boy got off the bus. 
The girl got off the bus. 
fhen they all ran fast. 

Sonne: Reprinted with permission of Scott. Foresman Co. 



requires knowledge of the other program, which implies some form of 
communication. 

The important issue here is that the individual responsible for program 
coordination must have extensive knowledge of how readers develop and of 
frameworks of instruction. It is not possible to attend to the relationship 
between the learning in two programs if one is aware only of the activity in 
one of the programs, as is frequently the case (Johnston, Allington, and 
Aftlcrbach, 1985). There needs to be extensive investigation of ways in 
which cooperative instructional programs can be developed. This raises the 
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issues of communication and awareness of other teachers' approaches, as 
well as that of compatibility of knowledge structures It seems likely that the 
more congruent teachers' knowledge structures are. the less time is likely to 
be involved in communicating and coordinating programs, and the more 
congruent the programs arc likely to be in the first place. Similarly, when 
two teachers have an extensive knowledge of how the other teaches, even if 
their knowledge structures differ, they are more likely to be able to build 
bridges between each others' programs. 

Current "cellular organization" (Lortie, 1976) of classrooms tends to 
isolate teachers from one another so that they do not acquire knowledge of 
each others' teaching approaches. The use of in-class approaches for support 
program instruction may foster a greater knowledge of program compati- 
bility, but because of ego defense may actually reduce the likelihood of 
cooperation unless the arrangement is self-induced. Various contextual fac- 
tors seem likely to influence the probability of teachers cooperating. For 
example, competitive organizational structures such as those set up in some 
career ladder systems seem less conducive to coordination than do coopera- 
tive organizational structures (Ames and Ames, 1984). 

Pugach and Johnson (1987) have shown that training teachers to be 
"peer collaborators" can be an effective way to solve the instructional 
problems of students with mild learning and behavior problems. Their ap- 
proach has involved helping teachers to structure and focus their conferences 
around particular children or groups of children. The dialogues are initiated 
by a teacher who has an instructional problem to solve and feels he or she 
needs support. The success of such a program must surely rest on the 
non-ego-threatening response of the cooperating teacher. These teachers are 
trained to respond to the initiator in a similar way to the rcpons ~ of the writing 
teacher in a process writing class. Perhaps the facts that such meetings arc 
expected and the timing is intended to be preventive (suggesting that the 
problem is not yet serious) help to allow the teachers to seek help. However, 
the collaborations they report were between regular classroom teachers. 
Whether or not they could operate between a regular classroom teacher and a 
specialist teacher without the differential power status fouling it up is an 
unanswered question. 

Opportunity to Leant. Although support for the notion of differential 
instruction for differential categories of students has been seriously under- 
mined (Marston, 1987; Reynolds, Wang, and Walbcrg, 1987; Stainback and 
Stainback, 1984), the residual effects of the approach arc still evident in the 
training of many specialists and teachers* and it remains implicit in the 
structure of many federal and state policies. In some respects this situation 
almost guarantees a lack of congruence between instructional program s. We 
believe that there is reasonable evidence that this situation deserves to be 
remedied, On the basis of the lack of support for differential instruction, and 
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the rather extensive evidence that sheer opportunity to learn is a powerful 
explanatory variable (Lcinhardt and Pallay, 1982), we think it would be better 
to begin planning support instruction with the assumption that the difference 
between the at-risk and other students resides largely in the need for differen- 
tial opportunities to learn (Good, 1983; Crawford, Kimball, and Patrick, 1984; 
Hicbcrt, 1983; Kicsling, 1978). Currently, participation in instructional sup- 
port programs seems to more often reduce the at-risk learner's exposure to 
instruction (Allington, 1986; Hayncs and Jenkins, 1986; Zigmond, Vallc- 
corsa, and Lcinhardt, 1980). Simply put, we argue that some learners will, at 
times, require la ;cr amounts of higher-quality literacy instruction in order to 
facilitate progress in literacy development. The instructional support we 
propose for at-risk learners would not, then, necessarily involve any alterna- 
tive curricular approach, but would, instead, provide at-risk learners with 
access to more, and better, instruction. 

Acceptance of this alternative assumption provides a reason to coordi- 
nate instruction— so that program effects arc cumulative rather than frag- 
mented. Whereas coordinated instructional efforts would enhance learning, 
we argue that at-risk learners will often need increased opportunities to learn. 
Although we arc advocating the position that the at-risk learner needs more 
time to learn, we do not advocate this in a permanent, trait-like way. With 
Clay (1985), we argue that with optimal instruction and more of it in the early 
stages, these students will become able to learn as fast as the other students. 
Thus the extra-instructional time requirement will disappear ultimately. 

Supplement, Not Supplant. A major provision of Chapter I has been 
that remedial assistance should supplement and not supplant the regular 
instructional efforts (unfortunately, this has not been a tenet of special educa- 
tion services). The intent was to prevent schools from simply using federal 
money to fund existing instructional efforts, but interpretations of this regula- 
tory language have produced aberrant instructional situations. For example, 
some believe that it prohibits in-class programs and others believe that it 
insists on different instructional methods and materials (Allington, 1986; 
Johnston, Allington, and Afflerbach, 1985; NIE, 1977). However, the basic 
problem is that witntn a given school Jay, unless the school day is extended 
for at-risk children, something must be supplanted in ord-.T to provide reme- 
dial support. Thus the notion of supplementation needs to be examined in 
terms of the alternatives open to school systems. 

One approach to this is to stagger the school day to allow some students 
to arrive and leave before others. This allows for reduced pupil-teacher ratios 
during ; urts of the day. Such an approach has been used successfully in 
Sweden (Clay, 1985). Similarly, it has been shown that using Chapter I 
money to reduce substantially the pupil-teacher ratio across the board can 
produce gains for the low-achieving students (Doss and Holly, 1982), pri- 
marily by increasing the amount of teacher-student contact. However, stag- 



Coordination, Collaboration, and Consistency 



339 



gcring to reduce pupil-teacher ratio docs not, in itself, result in consistent 
achievement gains (Balow, 1969; Filby, Barnctt, and Bosscrt, 1983; Cahcn, 
Filby, McCutchcon, and Kyle, 1983). Leinhardt (1980) notes some of the 
potential problems with this approach. First, smaller, more homogeneous 
classes may result in classroom teachers having lowered expectations for 
achievement. In her study of transition rooms, teachers with sma!! classes 
often offered less reading instruction than regular classroom teachers— a 
tcsult unlikely to enhance children's reading achievement. Second, unless 
the quality of instruction is improved, there would be little reason to predict 
greater achievement gains even if the opportunity to learn remained constant. 
In short, any strategy that has little positive effect on the quantity or quality of 
instruction offered, or both, seems unlikely to enhance achievement. For 
teachers who know what they arc doing, and who increase the opportunity to 
learn, this seems to be an option well worth exploring. We arc not yet 
convinced that this approach will make a difference for teachers with limited 
expertise, however. The teacher with limited expertise and thirty pupils still 
has limited expertise when assigned only fifteen pupils. 

Within a fixed-length school day, it seems possible to supplant regular 
instruction somewhat, provided the displacement is of such high quality that 
it is relatively short-lived. The critical features have to do with the fact that 
these students must learn substantially faster in this situation than the regular 
classroom program. The faster they learn, the briefer the supplantation. The 
conditions that produce this accelerated learning seem to include efficient, 
effective instruction focusing on independence, low pupil-teacher ratio (par- 
ticularly onc-on-onc), and early intervention (Carter, 1984; Clay, 1985). For 
example, undci these conditions. Clay (1982, 1985) and Pinnell 0987) have 
shown that for the children making the least progress in literacy development, 
onc-on-onc instruction for thirty minutes r *r day, four days a week, can bring 
most of them to levels comparable to the avvragc group of students within six 
to twenty weeks. In addition, by and large, these students have needed no 
further support throughout elementary school. It is irr irtant to note that this 
approach took place within a context of highly coi arable teacher knowl- 
edge s. r ,Ciurcs, extensive cooperative arrangements between teachers to 
release one another for parts of the school day, and considerable administra- 
tive support. However, with respect to the present discussion, the important 
point is that if regular instruction, in whole or in part, is temporarily and 
partially replaced with highly effective individual instruction, the problem of 
congruence is substantially reduced, except for the problem of reintegration 
into the regular curriculum, which is, of course, the goal of the program and 
remains critical. The reintegration means, first, that instruction must be 
directed toward independence. This is most critical because the effects of 
nonindcpendcncc focused instruction (highly teacher directed) will leave the 
child without flexible problem-solving strategies and more at the mercy of 
exactly the strategics that arc taught. This will be most problematic in the 
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classroom in independent work and will likely result in backsliding. Second, 
it means that the instruction should emphasize, and allocate most time to, the 
reading of connected text, which is what will generally be required in the 
regular classroom. 

Currently, the likelihood of such intensive, preventive approaches be- 
ing implemented seems minimal. The notion of prevention appears to require 
resource investment where there is not yet (and may never be) a problem, in 
times of shrinking resources such as the present, this docs not seem likely to 
gain favor. Although individualized instruction has received bad press in 
some of the literature on the grounds that small group instruction is more 
efficient (e.g., sec Roscnshine and Stevens, 1984), it appears that it may be 
inappropriate to generalize from classroom studies. The work of Bloom 
(1984), Clay (1982, 1985), and Pinncll (1987) strongly favors early, intensive, 
individualized onc-on-one-tutoring, and the Chapter I evaluation work of 
Carter (1984), Crawford, Kimball, and Patrick (1984), and Guthrie, Scifcrt, 
and Kline (1978) provides additional support for low teacher-pupil ratios, 
preferably onc-on-onc. 

Unfortunately, such early intervention is not encouraged by current 
assessment practices (Johnston, 1984) or by funding policies and practices, 
but such intcrven'ion has some clear advantages. Given approximately 
equivalent amounts of instruction, less able readers normally progress more 
slowly than do other students. Each year they fall increasingly further be- 
hind. This means that even with accelerated learning, later intervention will 
require longer intervention in order for students to catch up. In addition, the 
secondary characteristics that such students develop as a consequence of 
continued failure extend even further the necessary length of the intervention 
program (Johnston, 1985; Johnston and Winograd, 1985). Associated with 
this, support teachers currently have to deal with up to ten children at once, 
and because of scheduling problems, frequently the children range over 
several grades in performance level. The size and heterogeneity of the in- 
structional groups scheduled for support teachers makes faster learning im- 
possible. Each of these factors simply compounds and prolongs the coord ; - 
nation problem, thus we suggest that along with consideration of program 
coordination, additional assumptions that impact on the length and likelihood 
of coordination should be reexamined. 



CASE STUDIES 

How School Districts Have Improved Coordination 

The development of a well-coordinated approach to literacy instruction for 
at-risk students involves far more than an administrative mandate that 
instruction will be coordinated. Wc have observed such attempts. In one 
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case, in-class support program instruction was simply mandated districtwide. 
Little thought had evidently gone into this mandate since a previous mandate 
concerning use of separate and philosophically distinct curricula in the 
classroom and support program instruction was left intact. Thus there was 
literally nothing for teachers to plan collaboratively— each was supposed to 
follow a different packaged curriculum. It is perhaps not surprising, then, that 
we found little communication or coordination between classroom and sup- 
port teachers, even though they taught in the same classroom. Likewise, it is 
not surprising that teacher resentment was high and efforts to sabotage or 
simply ignore the mandate were frequent. 

Similarly, another district simply mandated daily teaching logs be ex- 
changed between classroom and support teachers with an accompanying 
monthly ten-minute joint planning session. Here again, the classroom and 
support program teachers were mandated to use separate and philsophically 
distinct curricular approaches. Our observations of instruction and the plan- 
ning conferences again indicated little coordination or communication about 
Instructional issues. 

One might say that coordination cannot be wished, mandated, or legis- 
lated into existence. We offer several descriptions of school district efforts to 
attain instructional programs for at-risk students that, at least, involve the 
development of a coordinated effort and, in some, have produced programs 
we find hard to improve upon. Each of these programs is described briefly, 
followed by a discussion of the implementation process. 

North Warren Project. The North Warren, New York, school district 
is small and located in a very rural and economically depressed region. The 
effort in this district has been to redesign the reading and language arts 
program in the single elementary school. The effort was initiated when a new 
Chapter I teacher and new district superintendent were employed. The Chap- 
ter I teacher organized a literature-based remedial effort in an attempt to 
involve at-risk learners in enhanced amounts of text reading with an emphasis 
on understanding. Her initiative, with support from the superintendent, has 
developed into a redesign of the classroom reading and language arts curricu- 
lum along with an integrated approach to support instruction for at-risk 
learners. A similar spreading effect has been reported in the Ohio Reading 
Recovery Project (Pinncll, 1987). 



The North Warren redesign was guided by three principles (Walmslcy, 
1986): 



(First,) if genuine reading and composing arc to be the primary language arts 
activities, they should occupy the largest portion of the language arts curricu- 
lum. . . .Second, . . . the notion of " sills first, application later" has been 
replaced with the principle of skills through application. . . Finally. . . . 
rather than offer [poor readers] a separate language arts curriculum . . . they 
engage in the same reading and writing activities as their classmates, but we 
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offer them help both in the classroom, and in the remedial reading room, where 
the books they have been reading in class arc the primary vehicle for remedial 
reading, and what they have been writing in class become!* the major vehicle for 
remedial writing ... (p. 5). 

To put these principles into practice, the program uses primarily trade 
books, supplemented by shorter pieces of literature (fiction and nonfiction) 
drawn from basal reading scries, magazines, newspapers, and elsewhere. 
The teaching staff, both classroom and suppport teachers, collaboratively 
select a theme and generate a list of books, articles, and excerpts on that 
theme. Each theme is studied fo about a month, with literary, science, and 
social studies themes developed to this point. The list of texts on the topic 
varies in difficulty with some materials below and others above grade level. 
All teachers read all books and then discuss ihem and collaboratively develop 
lessons and instructional activities. The skills of reading, such as "main 
idea," arc developed in the context of reading full-length literature rather 
than through skill sheets or separate lessons. In addition to reading activities 
of various sorts, writing activities of several kinds are developed. These 
usually include at least short-answer writing to test comprehension, extended 
writing related to the reading theme, and writing of the students' choice 
related to the theme. Here again, the skills of editing, such as grammar, 
spelling, and handwriting, arc developed in the context of the students* 
writing rather than through separate exercises. 

The classroom program is organized for ninety-minute daily sessions 
with remedial assistance for at-risk learners being offered both during their 
regular classroom instructional time and also during pull-out sessions in the 
remedial reading room. Walmslcy (1986) notes: 

This project was originally designed for remedial readers, and it is in the 
approach to remediation of reading and writing difficulties that the greatest 
departure from traditional methods is evident. The philosophy of remediation is 
quite simple. Poor readers and writers need as much cxposuie to and practice in 
reading full-length literature as their better reading counterparts, but they will 
need more than ordinary assistance if they arc to tackle this extended readme 
(p. 6). 

Hov/cvcr, Walmslcy (1986) notes that the difficulty in developing and 
implementing this approach should not be underestimated. Much planning 
and staff development time was expended during the three-year implementa- 
tion process. This approach requires that teachers develop compatible 
knowledge structures about reading instruction, classroom teachers become 
secure enough to allow support teachers into the classroom, and support 
teachers acquire the instructional and social skills needed for a collaborative 
effort. In addition, the support of the key district administrator seems critical 
for any number of reasons, but perhaps especially with fiscal support for the 




Coordination, Collaboration, and Consistency 343 



development and implementation activities. However, dramatic improve- 
ment of at-risk learners* literacy skills has been a central result (though 
literacy levels of achieving students also improved) at North Warren. 

Reading Recovery Program. The Reading Recovery Program is a 
highly effective early intervention program that was originated in New Zea- 
land by Marie Clay ( 1985). It has since been replicated in several parts of the 
United States, most notably Columbus, Ohio (Pinnell, 1987; Short, 1987). 
There arc several noteworthy characteristics of the program that make it 
unique. First, it begins early, after one year of instruction in New Zealand, or 
early in first grade in the United States. This is done in order to prevent the 
child from building a history of failure and the characteristics associated with 
that failure (e.g., Johnston and Winograd,' 1985; Johnston, 1985). Second, 
through one-on-onc intensive daily instruction, the performance of the lowest 
group of children is raised quickly (average of thirteen weeks) to the average 
level of the remainder of the children in the regular class. Ihcsc children arc 
then discontinued from the program and replaced by the next lowest children. 
Discontinued children, even at long-term followup, do not need further assis- 
tance (Clay, 1985; Pinnell, 1987). Third, the central component of the pro-* 
gram is an expert teacher who knows how to observe and record children's 
literacy development and to tailor instruction specifically to the child's needs. 

The instructional activities themselves arc characterized best by a 
whole language philosophy. Virtually all of the daily thirty-minute session is 
spent reading and writing. A child will read or reread about three books each 
session. There is a balance of reading and writing activity with a focus on 
those aspects of the conceptual, procedural, and conditional knowledge in- 
volved in independent reading and writing activity. Independence is seen as 
critical in order to prevent regression when children are returned to the 
classroom, thus self-correction rather than teacher correction is emphasized. 
The instructional session is also characterized by a high pace while maintain- 
ing the wait-time necessary to develop independence. This is made possible 
because of the teacher s expertise in matching the child and the task so that 
the error rate is optimal. There is a larger amount of easy reading to develop 
automatic responses. Interestingly there is little direct instruction or direct 
phonics instruction. 

Coordination between regular and remedial instruction is not the focus 
of this program. Indeed, the remedial teacher has first choice of when to 
schedule which children, and the program is individually tailored regardless 
of the nature of the classroom program. The Columbus cxpe ience has shown 
that the program is effective even when classrom practices conflict with it 
(Pinnell, *987). What can account for this apparent paradox? We have several 
possibilities that need further investigation. The first possibility has to do with 
the nature of the teacher expertise, which is the center of — or even the 
entire— program. Teachers arc trained to monitor children's development 
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and tailor instruction specifically to the individual child's response. Thus a 
balancing procedure is built into the program. If a child tends to overprcdict 
in his reading, the tutor gives feedback aimed at focusing his attention on 
monitoring the print detail. If a child focuses too much on print detail, losing 
the meaning, the tutor's feedback focuses her attention on the meaning of the 
text. In other words, there is an attempt to base instruction on the curriculum 
as experienced by the child. 

Second, the whole language principles upon which Reading Recovery 
arc based may be in essence a more "healing" program since whatever pieces 
(or "skills") of literacy the learner brings, the program places them within a 
context of natural language functioning. Indeed, if such a program produces a 
clearer understanding of what St means to be literate, perhaps the children arc 
able to place classroom skills instruction into a more sensible context. The 
detailed monitoring of each child's development prevents the development of 
inappropriate concepts about print and literacy while involving the child 
continually in reading genuine text for meaning. 

Third, the program focuses on flexible use of the various strategics in 
reading. Thu* diversity of responses emphasized, which may help to undo 
any rigid ? ty tf&t wbuilt into the rcgiJar classroom program. At the same time, 
the sheer volume of reading done during the sessions ensures automatic 
responses in the context of "real" reading. 

Nonetheless, those who arc directing the program are greatly con- 
cerned about ensuring the compatibility of the classroom and remedial pro- 
grams, and have taken steps to ensure the development of this compatibility. 
In particular, they have encouraged classroom teachers to collect particular 
types of information about children's reading development. Teachers' views 
of the children and how to teach them tend to revolve around how they 
"know" the child (Johnston, 1987). Thus if teachers view the child in a 
similar way, they arc likely to teach in more compatible ways and to have 
more grounds for discussion, which is likely to produce program conver- 
gence. Initial results suggest that more congruent teacher pairs produce more 
powerful effects (Pinncll, 1987). 

Westwood Project, Although it shares some features with the North 
Warren project, the Westwood (New York) project is historically older and 
evolved quite differently (Lee ct aL 1986). The use of trade book reading and 
composition as primary support instruction activities, a mixed in-class and 
pullout model, and a well-developed district philosophy for the integrated 
teaching of the language arts arc similarities that first come to view. 
However, the impetus for coordination, the building-level special services 
team, seems to stem from a board of education decision that each resident 
child in this sprawling suburban district would have adequate and appropriate 
educational opportunities available. This decision led to the creation of a 
districtwidc plan for instruction of at-risk learners. 
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In order to provide better for appropriate educational ser/iccs, the 
special services team, comprised of classroom and support teachers, was 
created. Its function is to review, with the classroom teacher, the issues 
involved when any child experiences learning difficulties. This building-level 
team decides on appropriate interventions, which range from assistance for 
the classroom teacher to remedial reading, mathematics, or writing services, 
a variety of special education services, and instructional planning assistance. 
The district uses multiple categories of funds to pay support teachers, so a 
reading teacher might be paid in part from Chapter I, state compensatory, 
local, and, perhaps, special education funds. Thus there is little quibble about 
who can work with whom. However, categorical identification of students 
is rare, or at least knowledge of categorical identification is rare. We found 
that only the central office administrator had reliable records on which stu- 
dents "belonged" to which categorical program. The classroom teachers, 
reading teachers, special education teachers* and principal had no reliable 
knowledge of the categorical identification of the students with whom they 
worked. What each individual knew was that Jimmy needed additional 
instruction in reading* Buffyf in math, and Wanda fecded strategics to 
control her emotional outbursts. 

In short, the district philosophy was one of providing whatever instruc- 
tional support was needed without worrying about categorical identification. 
Thus reading teachers worked with identified learning-disabled children, a 
special education teacher worked with Chapter 1 children, and both 
worked with children who needed support but were not eligible for any 
categorical program. The use of multiple funding sources to employ the 
support teachers blurred categorical differences. The addition of local funds 
to support the effort ensured that all children who needed assistance received 
it, whether or not they qualified for any categorical program. The general 
aUituJc in the district was simply to provide whatever instructional support 
was deemed appropriate by the special services team and ignore categorical 
eligibility. 

The district has a long history of ongoing staff development and each 
elementary school has *: building language arts coordinator. The language arts 
coordinator is a full-time teacher who works with children, develops instruc 
tion with teachers, offers model teaching lessons in classrooms, orders all 
materials, and directs all assessments of student learning. This position is the 
building-level liason to the district language arts cabinet that plans all staff 
development, curriculum revision, and materials purchases, although other 
teachers in the building will also serve on this cabinet. The cabinet, language 
arts coordinators, and teachers arc guided by an explicit and written district 
philosophy of literacy development. 

However, unlike the North Warren project, support teachers do not 
necessarily provide additional instruction on the curricular materials used in 
the classroom. Rather, while they focus on increasing the amount of genuine 
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reading and composition activities for at-risk learners, they select materials 
from a v/ide array of potential resources. Thus while these at-risk students 
receive greater opportunities to engage in assisted contextual reading and 
writing, the coordination is more often at the skill or strategy level. By 
analyzing the classroom curricular materials and instruction, the support 
teachers work to create the "balance" we noted earlier. Collaboration be- 
tween the support teachers is prevalent but not as prevalent as in North 
Wan-en. Nonetheless, the resultant instructional support for at-risk learners 
enhances opportunities to engage in genuine reading and writing activities 
arid provides additional instructional support for acquiring control over skills 
and strategies necessary for success in the classroom reading and language 
arts curriculum. 

Here again, we saw the influence of a districtwide administrator on the 
development and implementation of a coordinated program. In addition, 
principals are involved in a variety of ways but are most obviously involved in 
working out scheduling difficulties. This could be a serious problem since the 
district also adheres to a plan whereby all students in need of any support 
services are scheduled out of the room during the same one-hour block of 
time. Thus in some rooms the at-risk learners leave from 9:00 to 10:00, while 
in other rooms they arc absent between 1:00 and 2:00. Although difficult to 
schedule (but not impossible, principals assure us), since a dozen support 
teachers' schedules need to be considered, this plan assures classroom teach- 
ers and all learners of a largely uninterrupted instructional day. 

Aurora Project. A variety of concerns about instructional support 
programs led to the redesign of Chapter I programs in the Aurora, Colorado, 
schools (Rickcrt, Ripple, and Coleman, 1985). These concerns included the 
fragmentation of the school day for at-risk students who were pulled out of 
classroom instructional settings for special services, lack of appropriate sepa- 
rate instructional spaces with growing school enrollments, and the undcrutili- 
zation of support teacher expertise in enhancing classroom reading and lan- 
guage arts instruction. District and school administrators, as well as board of 
education members, felt that the lack of coordination between instructional 
programs hindered the educational development of at-risk learners (Rickcrt, 
1987). Thus a district task force was created to study and recommend meth- 
ods for enhancing the coordination between instructional support programs 
and the regular education program. 

The proposed primary shift was to move from a pullout program to an 
in-class instructional support design. In the firs? year of the project, support 
teachers were to implement the in-class design with at least one classroom 
teacher and were to begin to meet twice monthly with all classroom teachers 
whose children were served. These meetings were to be sketched on a 
monthly reporting form that was then submitted to the centra! office. The 
support teachers were to find ways that their instruction could be coordi- 
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natcd with the classroom program, and a variety of such strategics were de- 
veloped. 

However, there were significant implementation problems that year. 
Among those most common were: the difficulty of finding common free 
periods to meet twice each month, the refusal of some classroom teachers to 
meet for cooperative planning, philosophical differences about how reading 
is best taught, personality conflicts between the teachers, the difficulty some 
classroom teachers had accepting another teacher in the room, and, finally, 
the difficulty in establishing two-way communication, primarily getting 
classroom teachers to participate actively in the planning, as opposed to a 
passive listening participation. As a result of these difficulties a variety of 
staff development programs were offered, including programs that provided 
support instruction teachers with improved interpersonal skills necessary for 
collaboration, programs on the philosophy of coordinated instruction and 
in-class models, and opportunities for teachers to visit classrooms where 
in-class models were working well. Over time, the in-class component and 
the collaborative planning have been well integrated into the school system 
and teacher acceptance, although not universal, is substantial. 

There were several building-level features and options that enhanced 
acceptance of the redesign. First, when building administrator support was 
clearly available, the transition was more easily accomplished. When 
building administrators assisted in planning meeting times and instructional 
interventions, implementation went more smoothly. Provisions that allowed 
teachers to decide who would work together also seemed important. When- 
ever two teachers who did not want to work together were forced to do so, 
little communication, coordination, or collaboration was evident, even 
though meetings were held as required and instruction was delivered in the 
mainstream classroom. 

Currently, observations of instruction suggest coordination is most 
generally evident. Even in those cases where the pullout model is used, the 
communication between teachers has enhanced the coordination of instruc- 
tion. In the in-class efforts, coordination is also easily evident and is higher 
than in the early stages. However, it seems that communication between 
teachers is the primary factor, not necessarily the location where instruc- 
tional support is offered (Rickcrt, 1987). In both the elementary and middle 
schools, teacher acceptance of collaboration and in-class support instruction 
is high. As both sets of teachers have become more experienced and knowl- 
edgeable about collaborative planning, the process has improved and gained 
support (Reed, 1987). 

The Aurora experience again indicates the need to consider program 
redesign as a long-term and ongoing process. The success can be traced to the 
administrative support for the effort, from both district- and school-level 
administrators. Staff development, flexibility, and experience all worked to 
facilitate implementation over the three-year period. 
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SUMMARY AND CONCLUSIONS 

We have reviewed the available research on instructional coordination and 
tried to clarify what coordination is, what it does, and what is likely to 
maximize its occurrence. Our particular focus in this chapter has been on the 
lowest level of coordination— classroom and support program instruction— 
that which impacts the individual student's learning directly. Wc have exam- 
ined coordination as it influences the quality of instruction delivered to 
students. We have argued that current evidence suggests many at-risk learn- 
ers do not currently have access to instructional settings that offer larger 
amounts of high-quality literacy instruction driven by a consistent curricular 
approach. Rather, the least able students arc those most likely to participate 
in instructional settings that offer reduced amounts of instruction generally 
and a fragmented curricular experience frequently. Those who need the most 
carefully organized instruction receive, instead, a virtually incoherent mix- 
ture of competing curricular approaches and emphases. Such need not be the 
case, as indicated by the briefcase studies of well-planned support programs. 

Our experience suggests that coordinated efforts are not easily 
achieved, and most difficult may be obtaining shifts in the content of instruc- 
tion offered in support of regular programs, in order to produce an integrated 
program of instruction for the individual student. Studies that have examined 
coordination among regular and various support programs, regardless of 
definition, have invariably found it rare, but where evident, important to 
children's learning. However, wc have found that there is little clear descrip- 
tion of exactly what coordinated instruction would look like. Although wc 
have made an attempt, there are currently substantial ideological differences 
concerning methods of reading instruction, and thus wc see a need for re- 
search directed toward clarifying these issues. However, whatever the ideol- 
ogy, it docs not seem unreasonable that within a given school, for a given 
pupil, the instruction be coherently articulated. 

The major issue really comes down to coordination of the curricula as 
they arc experienced by the student. That is, the objective is to set up a 
situation in which students find that support instruction makes it easier to 
make sense of the regular instruction. 

Wc feel that curricular coordination may be further encouraged at the 
local level through some of the following: 

1. Explicit and cooperatively developed curricula with similarly explicit 
rationales would seem likely to enhance instructional congruence. 

2. Explicit curricula would help, but these would certainly not guarantee 
coherently articulated instructional services. A serious stumbling block 
remains in the form of teachers' knowledge structures, or what might be 
termed the "implicit curriculum." This implicates teacher training and 
programmatic teacher inscrvice development. Teacher training initially 
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develops these knowledge structures, and subsequent teacher isolation, 
issues of "turf," and the other institutional and policy issues discussed 
perpetuate the differences. 

3. Communication among teachers, both regular and support, might be 
fostered in a variety of ways, including requiring regular conferences 
between teachers involved with specific special needs children. Time 
must be specifically allocated to this activity but time allocation is not 
enough, All teachers will have to acquire a sense of ownership of the 
targeted student and his or her instruction. Cooperative planning may not 
be easily accomplished but it must begin. 

4. Observation of at-risk student performance in another instructional set- 
ting is likely to provide better knowledge of different components of the 
child's actual curriculum, and concrete grounds for discussions between 
teachers. 

5. Continuous process records of children's development may be very 
helpful in inducing curricular alignment. For example, such records 
might contain samples of the child's writing, and running records of 
reading behaviors (Clay, 1985) over time. If teachers are required to keep 
such records and arc provided with the expertise to do so with minimal 
additional effort, the nature of the files will focus on instruction, and the 
data within the files will give a more concrete foundation for teachers' 
discussions of students' progress and difficulties. 

We have argued that, in addition to attempts to improve coordination, it 
would be helpful to have concurrent progress toward reducing the demands 
on coordination. In particular, we have stressed the need for: 

1. Kxplicit district policies supporting coordination 

2. Early attention to learning difficulties with an emphasis on prevention 

3. Intensive intervention in order to climinaic the need for support services 
as quickly as possible 

4. Low teacher-pupil ratios, particularly in the early stages, and even one- 
to-one support instruction where possible 

.S. Support programs that focus on student independence so that students 
might return to the regular instruction and maintain progress 

6. Support programs that focus on service quality more than categorical 
identification and regulatory compliance. 

Achieving the type of support instruction we envision is not an easy 
task. Simple mandates to coordinate do not work. Simple accountability 
measures, such as a written log of planning conferences, do not work. In 
order to achieve the current level of coordination observed, our case study 
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districts invariably worked over extended periods of time to develop exper- 
tise in both classroom and support teachers and to develop a local design that 
worked with the particular features of that district. The impetus for the 
redesign in each case, however, was concern for the academic experience of 
the individual at-risk student. 
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Each of the chapters in this volume reviews a considerable portion of the 
research on effective programs for students at risk of school failure, and 
among them hundreds of conclusions are drawn. However, there is a small 
set of themes that runs through these numerous conclusions and that have the 
greatest importance for practice and policy. These arc as follows: 

1. The setting within which remedial or special education services arc 
provided makes little difference. What docs matter is the quality of the 
programs implemented in the setting. 

2. Prevention and early intervention arc much more promising than waiting 
for learning deficits to accumulate and only then providing remedial or special 
education services. 

3. Effective classroom and pullout programs for students at risk tend to 
accommodate instruction to individual needs while maximizing direct in- 
struction. They frequently assess student progress through a structured hier- 
archy of skills, and adapt instructional strategics to the results of these 
assessments. Highly effective and cost-effective classroom programs include 
continuous-progress and cooperative learning models. When pullout pro- 
grams arc used, they should be intensive (e.g.. one-to-one tutoring or 
computer-assisted instruction), brief, and designed to quickly catch students 
up with the rest of the class, not to support them indefinitely. 
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4. Preschool and extended-day kindergarten programs can contribute to the 
cognitive and social development of < ildren from low-SES backgrounds. 
Effective preschool programs tend to e phasize exploration, language devel- 
opment, and play, not academics. Effective kingergarten programs build 
language and prereading skills using structured, well-organized, comprehen- 
sive approaches, 

5. Remedial and special education services are too often poorly integrated 
with the regular education program, and therefore burden at-risk students 
with the task of accommodating very different approaches. Collaboration and 
consistency between regular, remedial, and special education are essential. 

6. Teacher behaviors associated with outstanding achievement gains for 
students in Chapter I pullout programs and for mainstrcamcd academically 
handicapped students in regular classrooms tend to be similar to behaviors 
found to be effective with all students. This and other research tends to 
suggest that effective practices for students at risk tend not to be qualitatively 
different frcrri the best practices of general education. 



SUCCESS FOR ALL: A COMPREHENSIVE 
DESIGN FOR REFORM OF REMEDIAL AND 
SPECIAL EDUCATION 

In addition to the specific conclusions just outlined, three more general 
conclusions should be abundantly clear from the preceding chapters. First, 
the remedial and special education programs now in widespread use arc not 
doing enough to overcome the problems of students at risk for school failure. 
Second, we know enough about effective programs and practices for students 
at risk to increase substantially the number of students who leave the elemen- 
tary school with a firm basis in the basic skills. Third, we still have a long way 
to go in translating research on effective practices for students at risk into 
rcplicablc models and in learning how to successfully disseminate and imple- 
ment such models on a broad scale. 

Major changes arc needed in our approach to education of students at 
risk for school failure. Although many improvements can and should be made 
within the programs most typically seen today, we believe that to make a 
marked difference in the chance that at-risk students will succeed in school 
and in life, fundamental changes arc needed in the basic organization of the 
elementary school. 

In this section, we propose a model of elementary school organization 
that incorporates much of what we have learned about effective programs for 
students at risk. This model, which we call Success for All, was piloted and 
evaluated at one Baltimore City elementary school during the 1987-88 school 
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year, and is being expanded to additional schools during the 1988-89 school 
year! The program elements are being introduced gradually over time, and 
many of the objectives of the model could be accomplished differently in 
different schools, so the model is described here both in its complete concep- 
tion and as it exists at this writing. 

The Pilot School 

The Success for All model is currently being implemented at Abbottston 
Elementary School, a prc-K to 5 school with approximately 440 students. 
Almost all students are black; 76 percent receive free lunch. The school was 
selected from among the category of schools that receive the most intensive 
Chapter I resources in the city of Baltimore. In addition, the school had to 
meet a set of criteria specifying lack of other major programs, adequate 
space, location near Johns Hopkins University, and manageable size. In 
spring 1987, the principal and school staff were given an opportunity to serve 
as the Success for AH pilot site, and they voted unanimously to do so. The 
intention in the selection process was to locale a school that served a typical 
inner-city population with a staff that was typical of Baltimore City elemen- 
tary school teachers but willing and able to work with our staff to refine and 
implement the model. The program at Abbottston Elementary is being com- 
pared to a matched control school. 

Program Design 

The main organizing principle behind the Success for All design is that no 
child is permitted to fall behind in basic skills. Resources are concentrated in 
the early grades to attempt to ensure that every child is successful. 

Three essential ideas about school and classroom improvement arc 
central to the design of Success for All. The first premise is that the best place 
to work on ensuring success for all students is in the classroom and its 
instructional program. Further, it is assumed that research-based program- 
matic efforts will have the highest chance for achieving the important goal of 
ensuring success tor all. That is, rather than focusingon single variables, such 
as learning time, parent involvement, or reading methods, the improvement 
efforts must be focused on implementation of strategics that successfully 
address all major components of learning in classrooms. 

The second essential idea is the importance of responsiveness to 
students' needs for corrective instruction. When students experience diffi- 
culties, they must receive specifically targeted assistance right away, not 
weeks or months later when small deficits have accumulated into large ones. 
The timing of remedial efforts as well as the integration of these efforts with 
regular instruction is seen as a critical factor in ensuring success for all 
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students. Incorporation of specific structures for tutoring students who arc 
not keeping up with their classmates and for frequent assessment, feedback 
and correction, address the need to catch problems early. 

The third essential idea is the need for the flexible use of school re- 
sources, in particular time and personnel. In most schools, a set level of 
resources is provided to students, and student success is allowed to vary. In 
Success for All, success is seen as an entitlement for ail students, and it is the 
resources needed to ensure success that are allowed to vary. Student learning 
deficits can first be addressed by the reading teacher. If this is no! enough the 
student may be assigned to a reading tutor. If the student's problem is due to a 
problem at home, a family support team is available to work with families and 
social agencies. If health or psychological services arc needed, they arc 
provided. Few if any students will be assigned to special education, and few if 
any will be retained. It is seen as the responsibility of the school to sec that 
every child succeeds in the regular program, no matter what their needs 
may be. 



Program Elements 

f a!^ T '"° rX - 0nc of thc most important elements of the Success 
for AH model is the use of tutors to support students' success in reading 
Tutors replace Chapter I and special education resource teachers; research 
on class size suggests that one-to-one instruction is far more effective than 
instruction given to small groups (sec Glass, Cahcn, Smith, and Filby 1982- 
Slavin. ,n press). Thc tutors arc certified teachers with experience in teaching 
Chapter I, special education, and/or primary reading. Tutors work onc-on- 
onc with students who arc having difficulties keeping up with their reading 
groups. Students arc taken from their homeroom classes by thc tutors for 
twenty-minute sessions during times other than reading or math periods. In 

EE? i' T S l UPP °, r L StUdentS ' SUCCCSS in thc rc S ular readin 8 curriculum, 
rather than teaching different objectives. For example, if the regular reading 
teacher is working on long vowels, so does thc tutor. However, tutors seek to 
identify learning deficits and use different strategics to teach thc same skills 

During daily ninety-minute reading periods, tutors serve as additional 
reading teachers to reduce class size for reading. Information on students' 
specific deficits and needs pass between reading teachers and tutors on brief 
forms, and reading teachers and tutors arc given regular times to meet for 
purposes of coordinating their approaches with individual children 

Initial placements in tutoring arc made based on informal reading inven- 
tories administered at thc beginning of thc school year. Subsequently deci- 
sions about who receives tutoring arc made based on eight-week assess- 
ments, which include teachers' judgments as well as more formal assess- 
ments. First graders receive first priority for tutoring, on thc assumption that 
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the primary function of the tutors is to help all students be successful in 
reading the first time, before they become remedial readers. 

Part of the role of the tutors is to identify effective teaching strategics 
that the regular classroom reading teachers may apply. Many students re- 
ceive tutoring for one eight-week block and then remain with their class the 
next eight weeks to sec if the regular teacher can keep the child on track 
without tutoring. 

At Abbottston Elementary, a total of six tutors arc being provided for 
grades K through 3, a total of approximately 300 students. They each work 
with a total of eleven students per day, so about 22 percent of all students in 
grades K through 3 are receiving tutoring at any given time. However, since 
tutoring services arc concentrated on first graders, approximately 40 percent 
of first graders are receiving tutoring. Since homeroom classes arc reduced to 
twenty-five students in grades 1 through 3, use of tutors as reading teachers 
reduces reading class sizes to about fifteen. 

A Program Facilitator (see below) supervises the tutors' activities, 
sitting in with tutors to observe tutoring sessions, suggesting or modeling 
alternative approaches for specific students, and so on. 

Program Facilitator. A Program Facilitator works at the school full 
time to oversee (with the principal) the operation of the Success for All model. 
The Facilitator helps plan the Success for All program, helps the principal with 
scheduling, and visits classes and tutoring sessions frequently to help teach- 
ers and tutors with individual problems. The Program Facilitator may work 
with individual children having particular difficulties to find successful 
strategics for teaching them, and then return the children to the tutors or 
teachers. She or he helps teachers and tutors deal with any behavior prob- 
lems or other special problems, and coordinates the activities of the Family 
Support Team with those of the instructional staff. 

Reading Program. Students in grades I through 5 arc regrouped foi 
reading. That is, students arc assigned to heterogeneous, age-grouped classes 
with class sizes of about twenty-five most of the day, but during a regular 
ninety-minute reading period they arc regrouped according to reading per- 
formance levels into reading classes of about fifteen students all at one level. 
For example, a 2-1 reading class nvght contain first, second, and thttd grade 
students all reading at the same level. Beginning at midyear, kindergarten 
students arc regrouped in the same ✓ay. Initial placements in reading classes 
arc made on the basis of individually administered informal reading invento- 
ries, but placements are revised every eight weeks on the basis of assess- 
ments of progress through the reading curriculum. 

The idea behind regrouping is to allow teachers to teach the whole 
reading class without using reading groups within the class. This greatly 
reduces the time needed for scatwork and increases direct instruction time. 
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This regrouping plan is a form of the Joplin Plan, which has been found to 
increase student achievement in reading (sec Slavin, 1987). 

The reading program itself (Madden, Slavin, Karwcit, Livcrmon, and 
Stevens, 1987) has been designed to take full advantage of having ninety 
minutes of direct instruction. The reading program emphasizes development 
of basic language skills, auditory discrimination, and sound and letter recog- 
nition skills in kindergarten, and uses an approach based on sound blending 
and synthetic phonics starting in first grade (although kindergarten students 
who show readiness arc accelerated into the first grade program). Pcabody 
Language Development kits arc used in grades prc-K, K, and I to help 
students build language concepts. The reading program emphasizes oral 
reading to partners as well as to the teacher, instruction in story structure and 
specific comprehension skills (especially in grades 2 and 3), and integration of 
reading and writing. At the kindergarten and first grade reading levels, 
students use phonetic mini-books. When they reach the 2-1 reading level, 
students begin to use the district's Macmillan basal scries. The program tor 
grades 2 through 5 is essentially Cooperative Integrated Reading and Compo- 
sition or CIRC (Stevens, Madden, Slavin, and Famish, 1987), which as been 
found to increase student reading achievement in two field experiments. 

Language Arts! Writing. Language arts and writing instruction arc 
given in the homeroom class. The emphasis of the language arts/writing 
program is on development of oral language skills in preschool and kindcrgar- 
tcn» and development of creative and expository writing skills in grades 1 
through 5. Students arc taught to write using a writing process model in which 
they plan, draft, revise, edit, and ultimately publish comf >sitions. In the 
early grades, students arc encouraged to use invented spelling, applying letter 
sounds and phonies skills they have learned in their reading program. Lan- 
guage mechanics instruction (e.g., capitalization, punctuation) is provided in 
the context of students" writing and in specific lessons provided based on 
students' needs Spelling and handwriting instruction is also being provided 
in the language arts 'writing period. The language arts/writing program is 
adapted from Cooperative Integrated Reading and Composition (Stevens ct 
aL, 1987). 

Mathematics Although implementation of the mathematics com- 
ponent of Success for All has not yet begun at the pilot school, plans call for a 
mathematics program emphasizing manipulative*, diagrams, and other means 
of making mathematical concepts real to students, and problem solving, 
estimation, and use of calculators. Beginning in grade 3, students will be 
regrouped for mathematics, and will begin to use a form of cooperative 
learning based on Team Assisted Individualization (Slavin, 1985). 

Eight-Week Reading Assessments and Individual Academic 
Plans. Every eiuht weeks, reading teachers assess student progress 
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through the reading program. Any students who seem to be having difficulties 
keeping up or who may be candidates for acceleration into a higher reading 
group are referred to the tutors for individual assessments on Informal 
Reading Inventories (IRIs). The results of the assessments are used to deter- 
mine who is to receive tutoring, to suggest other adaptations in students' 
programs, and to identify students who need other types of assistance, such 
as family interventions, vision/hearing screening, or (in extreme cases) refer- 
ral for special education evaluation. On the basis of the eight-week assess- 
ments, Individual Academic Plans (lAPs) arc developed for each student 
indicating areas of weakness to be addressed by classroom teachers and'or 
tutors. 

Class Size. Class sizes in the Success for All school arc held to 
twenty-five in grades I through 3, twenty in kindergarten, and fifteen in 
prc-K. 

Preschool. The Success for All school provides a half-day preschool 
for all eligible students. Class size for preschool is held to fifteen students, 
with one teacher and one aide. The focus of the preschool is on providing a 
balanced and dcvclopmcntally appropriate learning experience for four-year- 
olds. The curriculum places a heavy emphasis on the development and use of 
language. Specific structured approaches to language development with dem- 
onstrated effectiveness arc being integrated into the curriculum. 

The curriculum itself follows a thematic approach, integrating various 
prcrcadiness activities. A curriculum manual with specific suggested activi- 
ties has been written and forms the basis for teacher training. One important 
element of the program emphasizes structured story telling and retelling, 
which is designed to enhance students' comprehension as well as expressive 
and receptive language skills. Pcabody Language Development kits arc used 
to build students' language concepts and or J expression. 

Kindergarten. The kindergarten program is a full-day program with 
e!ass sues of no more than twenty children, with one teacher and one aide in 
each class. The major focus of the program is on providing a balanced 
curriculum of academic readiness and nonacademic activities of music, art, 
and movement. 

Continuing the approach in the preschool, a major emphasis in the 
kindergarten is on ianguagc use and development. Strategies for stressing 
language as a critical part of prcrcading readiness arc incorporated into the 
curriculum. These include the story telling and retelling program begun in 
preschool and use of the Pcabody Language Development kits, as well as 
training teachers to use every available opportunity to promote students* use 
of elaborated, descriptive language. Prcrcading instruction in the kindergar- 
ten emphasizes reading to students, teaching concepts about print, auditory 
discrimination training, and letter sounds and sound blending. 

C * l >>'>. 



362 



Effective Programs for Students at Risk 



[C 



All students entering kindergarten spend the first half of the year in their 
regular kindergarten classroom with their regular teacher. This half year 
allows for settling in to the routine of a full day at school and to the routine of 
academic activities. At the start of the second half of the year, those kinder- 
garteners who arc ready to do so move into the reading program itself, while 
others continue with the readiness program in their regular classroom. 

The afternoon program in the kindergarten is not focused directly on 
academics. There is a period, however, where additional help might be given 
to students who did not understand the morning's lessons. This is not a 
onc-on-onc tutoring session, but a small group session where the teacher or 
aide goes over specific material with a group of students. 

Family Support Team. A Family Support Team consisting of at least 
one social worker and one paraprofcssional parent liaison works full-time in 
the school. The Family Support Team provides parenting education and 
works to involve parents in support of their children's success in school, both 
in setting up a home reading program in which students read to their parents 
and in establishing opportunities for parents to volunteer within the school. 
Also, family support staff arc called on to provide assistance when there arc 
indications that students arc not working up to their full potential because of 
problems at home; for example, families of students who arc not receiving 
adequate - t-cp or nutrition, are not attending school regularly, or arc ex- 
hibiting serious behavior problems receive family support assistance. 

Special Education. Every effort is being made to deal with students' 
learning problems within the context of the regular classroom, as supple- 
mented by tutors. Students who would have ordinarily been referred to 
special education resource programs arc instead retained in class and helped 
by tutoring and family support services. A self-contained special education 
program is still provided, but very few referrals arc made to the program. 

Advisory and Steering Committees. An advisory committee com- 
posed of the building principal. Program Facilitator, teacher representatives 
and Johns Hopkins staff meets weekly to review the progress of the program 
and to identify and solve any problems that arise. In addition, a steering 
committee composed of Baltimore City administrators, the principal. Facili- 
tator, and Johns Hopkins staff meet monthly to resolve any system-level 
concerns and to keep the administration up-to-date on the progress of the 
program. 

Costs. In its full form, the Success for All program can be expensive 
However, in disadvantaged schools receiving high levels of Chapter I funds 
(such as Abbottston). the costs can be little or no more than those already 
provided by Chapter I and special education programs, because the Success 
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for All model replaces existing Chapter I and special education pullout 
services. The most expensive single element is the provision of preschool 
education; in Baltimore, the cvst of providing a teacher and an aide to a 
half-day class of fifteen is more than $1,850 per student per year in salaries 
alone. The extended-day kindergarten progarm is also very expensive. If 
these can be provided by other funds, the other elements of Success for All 
can typically be implemented with current levels of Chapter 1 and special 
education monies, as long as legal requirements regulating use of these funds 
are satisfied. 



Evaluation Results 

Implementation of Success for All at Abbottston Elementary began in Sep- 
tember 1987. The program h expected to remain at the school for at least five 
years. The program evaluation compares Abbottston students to those in a 
nearby school matched on socioeconomic status and historical achievement 
level. Then individual children were matched on the basis of standardized test 
scores. Fall Boehms and Metropolitans were used to match preschool and 
kindergarten students, respectively, and spring CATs were used to match 
students in grades 1-3. 

For more details on the research procedures and findings, sec Madden. 
Siavin, Karwcit, Livcrmon, and Dolan, 1988. 

Preschool and Kindergarten. Preschool results indicated that Suc- 
cess for Ail children scored significantly higher than control on the Test of 
Language Development (TOLD) Picture Vocabulary and Sentence Imitation 
Scales, and on the Merrill Language Screening Test's Comprehension scale, 
with effect sizes ranging from 0.44 to 0.66. Success for All kindergarteners 
outscorcd control students on the TOLD Sentence Imitation and Grammatic 
Completion scales, the Woodcock Letter-Word Test and Word Attack 
scales, and the Merrill Language Screening Test. Effect sizes ranged from 
0.47 to 0.71 except for Word Attack, which had an effect size of 3.74. 

First Grade. Across five scales taken from the individually adminis- 
tered Woodcock and Durrcll reading inventories, first graders scored at an 
average grade equivalent of 2.0 (50th percentile), in comparison to 1.5 in the 
control group (28th percentile). Effect sizes ranged from 0.34 to 1 .39. Among 
students who scored in the lowest 25 percent on the pretests, Success for All 
students scored at the 32nd percentile, in comparison to the 8th percentile foi 
similar control students. These results arc depicted in Figure 12-1. As the 
figure shows, the lowest 25 percent of the Success for All first graders 
outscorcd the average control students. 
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FIGURE 12-1. Mean Reading Scores for Success for All «nd Control Schools 
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Second Grade. At the second grade level. Success for All students 
significant':' outscorcd control on the Woodcock Letter- Word Identification 
and Word Attack scales, but not on the Durrcll scales. Similar results were 
obtained for the lowest 25 percent of students. 

Third Grade. The strongest effects of all were seen at the third grade 
level. Success for All third graders averaged 3.6 grade equivalents (47th 
percentile), in comparison to their counterparts in the control school, who 
averaged 2.4 (!7th percentile). Effect sizes ranged from 0.7 1 to 1.42. As in the 
first grade, the lowest 25 percent of third graders in Success for All outscorcd 
the average in the control school. The lowest -scoring 25 percent of Success 
for All students scored at the 19th percentile, in comparison to their matched 
counterparts who scored at the 2nd percentile. 

Retentions and Special Education Referrals. Some of the most im- 
p> iant effects of Success for All do not appear in the standardized test 
scores, but in the ability of the school to maintain students in the regular 
program. At Abbottston, approximately 12 percent of students in grades 1-3 
were retained the year before the program began. At the end of the first 
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program year, only one student was retained. In the previous year, thirty 
students were referred to special education and eighteen were accepted. 
Currently, two students diagnosed as retarded were the only ones assigned 
to special education for a learning problem. As noted earlier, Abbottston lost 
its special education resource teacher because of the program. 

In the first year. Success for All brought the average scores for children 
at all grade levels to almost the 50th percentile (the control school averaged 
at about the 28th). This is good, but it docs not yet fulfill the commitment to 
bring every child to grade level in the basic skills. Not until the current 
preschoolers have reached the third grade can this commitment be assessed. 
Also, long-term effects of the program must be determined to sec if early 
success does in fact eliminate or greatly reduce the need for continuing 
remedial or special education services. 

If, however. Success for All is ultimately found to meet its promise to 
provide all children with adequate skills, the implications for compensatory 
and special education could be dramatic. If it could be shown that high rates 
of learning problems are not inevitable consequences of poverty but could be 
prevented by the schools, the political calculus surrounding compensatory 
education would be greatly altered. Withholding proven, effective interven- 
tions from students at risk might be seen as tantamount to withholding 
effective medications from children with curable diseases. Special education 
could get out of the extremely expensive and largely ineffective business of 
serving large numbers of students categorized as "learning disabled" and 
instead concentrate its efforts on the truly handicapped. 

The success and widespread adoption of Success for All could also have 
many less immediately apparent consequences. First, it would reverse the 
growth of special education programs for the learning disabled. All but the 
most disabled (or bchaviorally disordered) students could be served within 
the regular classroom setting, supplemented by tutoring. This wou'u allow 
most students with learning disabilities to receive appropriate and effective 
services without the expensive and cumbersome special education assess- 
ment and referral processes now in use. Special education could return to a 
focus on truly "special" cases, where a child would come to the attention of 
the special education process only after appropriate placement in reading and 
mathematics, optimal classroom instruction, tutoring, family support inter- 
ventions, and other interventions within the regular education program had 
been tried and had failed. 

The impact of Success for All on remedial and compensatory education 
would be equally profound, Pullouts for remedial services would be brief and 
intensive. Tutoring sessions and decisions about who should receive tutoring 
would be made every six weeks. In this way, few students would receive 
remedial services every day for even a whole year— much less year after 
year, as so often happens under current practices. Chapter I would no longer 
be a separate program housed within 'he school, but would become an 
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integral part of the school's attempts to provide all students with an ade- 
quate level of basic skills. Every teacher in the elementary building, 
Chapter l-fundcd or not, would teach reading, so that teachers would have a 
common language and experience base from which to work together to 
ensure student success. This use of Chapter I func's is permitted under the 
recently passed Chapter I reauthorization for school, with at least 75 percent 
of their students receiving free lunch, and could be adapted to work within 
less disadvantaged schools. 

Widespread adoption of Success for All models could also have impor- 
tant effects on regular education. Even students who never received a day of 
tutoring or othc r services would benefit from a school organization plan that 
freed the teacher frcm constantly having to deal with students who lack basic 
skills and removed the constant disruptions inherent to remedial and special 
education pullouts The opportunity for bright students to be grouped with 
older students for reading and mathematics would allow for vertical acceler- 
ation in these subjects without the drawbacks of comprehensive tracking or 
separate gifted programs. 

Finally, i! Success for Al! is, in fact, successful, this could pose an 
imjv,-,i* >ra challenge to the political and legal status quo. Imagine that it could 
!>'_• demonstrated that virtually all students could, in principle, be successful 
♦n school, given a level of resources greater than that which is now available 
but within j« range that local, state, or federal governments coald provide. In 
this circumstance, withholding proven, effective services capable of pre- 
venting educational failure could be pcrcchoJ as similar to withholding 
effective antibiotics from children with bacterial diseases. At present, it is 
difficult to prove convincingly that per-pupi! expenditures arc strongly re- 
lated to achievement, and because of !his court cases challenging school 
finance policies arc typically unable to establish that gross disparities be- 
tween suburban and inner-city schools rcsu,t in educational disadvantages 
for the latter If it could be shown that money . pent v n ensuring the success of 
every child in the elementary grades could have a substantial and lasting 
impact, then there may arise political and legal pressures to provide every 
child with enough services to bring them to an acceptable level of basic skills. 



POLICY RECOMMENDATIONS 

How can programs for students at risk of school failuic be cformed to make a 
substantially greater impac: on student achievement? Tht chapters in this 
ton* have idem if cd critical changes in practice at many levels, from changes 
in teacher behaviors to changes in school and classroom organization to 
changes in instructional programs to changes m legal and political systems. 
The preceding sec ' on described a model elementary school program that 
opcrationalizcs many of these changes. However, bejond the particular 
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reforms suggested in the chapters, there are broader policy changes that are 
needed at the federal, state, and local levels to bring about lasting changes in 
the ability of remedial and specia. education programs to meet the needs of 
at-risk students. 

These policy changes flow from two observations about the current 
status of programs for students at risk. First, there is much more we need to 
know about how to ensure success for low -achieving students. Second, that 
which we do know is having little effect on current practice in remedial and 
special education, and there e few mechanisms in place for translating 
research into practice. 

The remainder of this chapter lay s out a set of actions wh : ch. if success- 
fully undertaken, would result in a major impro\cmcnt in the effectiveness of 
remedial and special education. 

Establish National Program Evaluation Centers 

One of the most vexing problems in translation of research into practice is 
that it is difficult for practitioners or policymakers to assess the adequacy of 
program evaluations. All piograms that arc disseminated claim to be effee 
tivc, but it is often difficult to determine the validity of these clair„s. The Joint 
Dissemination Review Panel (JDRP), an ad hoc group within the U.S. De- 
partment of Education, docs examine evaluation data and passes programs 
that appear to be effective. However, the quality of research designs required 
to pass JDRP is very low. For example, most programs claiming effects on 
achievement used fall-to-spring NCE or percentile gain 4 , or other prc-to-po.>t 
gains as the criterion for effectiveness, and these designs have been found to 
produce greatly inflated estimates of program effects (sec Chapters 1 and 2). 
Also, the JDRP has no way to determine whether or not it is seeing all 
evaluations of a gi v en program. A program might fail iu nine sites and succeed 
in one, and the program could be approved based on the one site. At present, 
there is no requirement lhat all evaluations be submitted to JDRP. 

Although reforming JDRP rules would be helpful, the JDRP system is 
fundamentally incapable of identifying programs that could be counted on to 
produce positi * achievement effects. There arc just too many ways that a 
program developer evaluating his or her own program can. in all good faith, 
bias the results in favor of program success. For example, developers can key 
their program to the limited set of objectives assessed on the criterion mea- 
sures, they can choose measures particularly likely to favoi their curriculum, 
they c„*n as control groups schools or teacher that refused the program 
and can therefore be assumed to be less motivated, they can devote extraor- 
dinary amounts of staff time in a small number of classrooms to ensure that 
the program will work; and so on. 

If we arc to expect federal, state, and local educational agencies to 
invest seriously in reforming programs for students at risk, we must have the 
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utmost confidence that we have a set of programs which, if properly imple- 
mented, will make a significant difference in the achievement of students at 
risk. This state of affairs will only come abou! when we have independent, 
rigorous, and widely recognized program evaluations, much as the Food and 
Drug Administration oversees independent evaluations of new drugs or Con- 
sumer Reports evaluates consumer goods. 

We propose that the Department of Education establish one or more 
independent evaluation centers to evaluate programs that could be used 
under Chapter 1 or special education funding. These centers would select 
programs that had already been shown to be successful by their developers 
(c g , prcgrams that had already passed JDRP> for independent evaluation. 
They would locate some number of school districts willing to evaluate the 
programs Each district would make available two or more similar schools 
whose principals and staffs had volunteered to use <hc program, and the 
evaluation center would randomly select one set of schools to use the pro- 
gram and one set to serve as a control group. The program developer would 
receive funding to train teachers, local support staff, and so on, to work in 
the experimental schools. Students in the experimental and control schools 
would be pre- and posttcstcd on measures specially designed or adapted by 
the evaluation center Although program dev elopers would know what objec- 
tives the measures assessed, they would not have access to the measures 
themselves (to keep them from gearing their program to the tests). These 
measures would assess achievement oroadly, and need not be restricted to 
papcr-and-pcncil assessments. For example, individually administered 
reading assessments arc essential in this area, a subset of students could be 
given such assessments In mathematics, higher-order problem-solving and 
estimation skills might be assessed using nontraditional formats, along with 
more traditional measures of computations and simple applications. Matrix 
sampling procedures might be used to achieve broad-scale assessment with- 
out taking too murh time for testing. For example, in assessing a writing 
program, one-third of the students might be randomly assigned to write a 
personal narrative, one-third an expository essay, and one-third a mystery 
story. 

Based on recommendations of the program developer, the evaluation 
center would prepare observation forms to assess degree of implementation 
ana would send trained observers into project classrooms (and control 
classes) to characterize the degree to which the program was being imple- 
mented and the differences between experimental and control classrooms. 
Also, evaluation center staff would carcfuli* monitor the costs of implement- 
ing the program in order to compute cost -effectiveness ratios at the end of the 
evaluation process. 

In most casts, program evaluations would be continued for two school 
years, to examine (in the second year) the program's effects when it is fully 
implemented and no longer novel. 



Effective Programs for Students at Risk 



369 



Clearly, the evaluations conducted by the independent evaluation cen- 
tcr(s) would have immediate national visibility. Programs that held up under 
the stringent conditions likely to be applied by these centers could be counted 
on to make a difference with students, and this fact could have a considerable 
impact on policy and practice. For example Chapter 1 and Title I legislation 
have always specified that school districts applying for funds must satisfy 
their state departments of education that their programs are "of a sufficient 
size, scope, and quality to give reasonable promise of substantial progress 
toward meeting the educational needs of the children being served/' The 
national average effect of Chapter I on reading and mathematics achievement 
is on the order of two to three percentile points at most (sec Chapter 1 of this 
book)— hardly "substantial progress." If there were programs known to be 
capable of providing "subst antial progress/' then there may be a legal basis 
for requiring that school districts inv est in such programs or demonstrate that 
their existing programs arc more effective. In any case, having a set of 
proven, effective programs would ccrtainl> provide a basis for federal, state, 
and local dissemination efforts focused on students at risk. 

Obviously , it makes most sense to have independent evaluation centers 
funded by the federal government, as their findings would apply throughout 
the country. However, states or even large districts could establish smaller- 
scale evaluation centers of their own to systematically locate promising 
programs and then rigorously evaluate them within the state or district Wc 
have established a center of this type at Johns Hopkins University to serve 
the Baltimore City Schools. This center, called the Baltimore Public Educa- 
tion Institute, is funded by the Abcll Foundation, a local pri\ate foundation, 
and has as one of its functions the task of bringing promising programs into 
the Baltimore City Public Schools and evaluating *hcm. 



Fund Rc: earch and Development of New Programs 

Most of the effective programs discussed in Chapters 2 through 4 arc very 
old, having been developed in the 1960s and 1970s. Some arc no longer in 
existence. One reason that few comprehensive progi.ims have been devel- 
oped in recent years is that since the Nixon administration, there has been 
very htilc federal support for the kinds of systematic development and evalu- 
ation of instructional programs that have the greatest promisc*for making 
substantial difference* m the achievement of at-risk students. If we are 
serious >n our undertaking to replace c.rrcnt remedial and special education 
progiams with proven, research-based alternatives, wc must have a wid. 1 
range of alternatives from which districts may choose, both to allow for 
adaptations to local circumstances or needs and to avoid putting governments 
in the uncomfortable position of promoting a small number of programs As 
the earlier chapters in this book demonstrate, wc ha\e learned a great deal in 
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the past twenty years, but little of what we have learned has been translated 
into practical, comprehensive, and replicablc programs. For example, it is 
ironic that much of the debate about the use of systematic direct instruction 
models to teach reading and mathematics to disadvantaged students still 
revolves around Distar, a program that is now older than many of the 
teachers using it. After three decades of advances in research on cognitive 
psychology, reading and mathematics education, and classroom organization 
and management, we can surely do better, yet the funds to undertake system- 
atic development work on the scale that would be required arc nowhere on 
the horizon. 

A major federal commitment is needed to fund promising development 
efforts specifically directed toward the needs of students at risk. It is not 
enough to state that we know "what works." We also need to know how to 
establish and maintain that which "works" in real cla:>srooms. Univcrsit>- 
and school-based developers could be funded over a period of years to 
develop, pilot test, refine, and evaluate programs that would ultimately be 
evaluated by the independent evaluation centers. Of course, not all programs 
would have to start from scratch; many promising programs now exist that 
are further along in the development-evaluation process. The idea would be 
to get the best minds in education (and related fields) working to solve a very 
well-defined objective of enormous importance— the prevention and remedi- 
ation of learning problems among students at risk. 

One requirement for the development effort would be that developers 
could be exempted from certain Chapter I or special education regulations, as 
long as they operate within rules that could, in principle, be enacted. For 
example, there was a recent debate in Congress about extending the opportu- 
nity for high-poverty schools to opt for schoolwidc projects, in w hich Chapter 
I funds could be used to benefit all students. The problem is that there were 
few examples of schoolwidc projects, fewer coherent models appropriate for 
use within schoolwidc projects, and almost no rigorous evaluations of such 
models The discussions of changes in the legislation would have been greatly 
enriched if research on coherent schoolwidc models could have been under 
way before the legislative changes were considered. 

In addition to providing major funding for researchers to develop and 
evaluate programs, it would be a good idea to provide smaller amounts oi 
funding and perhaps waivers of -crtain regulations to school districts that arc 
implementing innovative programs. In order to receive this funding, the 
districts could be required to set up an adequate experimental design, perhaps 
with the assistance or under the supervision of au independent evaluation 
center. 

Again, although the federal government is the logical source for funding 
of development and evaluation activities, state, local, and private foundation 
Hinds may also be appropriate for this purpose. 
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Resources to Help Districts Adopt Effective Programs 



Once wc have a set or proven, effective models for students at risk, it will 
next be necessary to provide a mechanism for implementing these programs 
successfully under Chapter 1 or special education funding. These implemen- 
tation issues must be taken very seriously; research on dissemination of 
externally developed innovations finds that implementation must be planned 
for and executed on a classroom-by -classroom basis. At a minimum, local 
districts tor intermediate districts, when appropriate) would need to have 
"circuit-riders" or even building-level coaches who arc experts in a given 
program and have full time responsibility to ensure that the new program is 
being adequately implemented by. among other things, visiting classes con- 
stantly to respond to teachers' concerns and sec tha f teachers arc adequately 
implementing the program. Prov iding release time for training, followup. and 
preparation of materials is also critical. 

For most types of innovations, it is important to involve teachers in 
selecting the innovation and in participating in planning the phasing in of the 
program. For example, schools whose Chapter 1 programs have been ineffec- 
tive in the past may be compelled to select a different program, but u/m/i 
program is selected should be left up to a free and informed choice of the 
professional staff. When appropriate, teachers may be allowed to choose 
different programs w ithin the same building, or ihc program may be phased in 
over time, with volunteers serving as the first wave of implemented. 

Providing training and support for staff developers at the local level 
could be done in many ways. State or regional Chapter 1 or special education 
effectiveness centers might be established. 7hc>c centers would employ 
experts in the validated models who also are experts in disscmina'ion of 
innovations. Alternatively, the developers themselves might be funded to 
provide training and followup to staff dc\ elopers located within adopting 
districts. 

The role of s?aff dev clopmcnt in this conception would be v astly differ 
cnt from current practice. The emphasis would be on sustained training and 
on-site followup in a relatively small number of proven programs, not the 
one-day workshops in larger numbers of programs which characterize much 
of current staff development. 

Dissemination of effective piograms covld be funded by federal, state, 
or local education agencies. One model for this might be the National Diffu- 
sion Network (NDN). a federal program that funds developers to refine and 
disseminate programs passed by JDRP. The NDN also funds a system of 
state facilitators who work to bring these programs into their states. 
However, adoption of the programs themselves is paid for by the districts 
The NDN system itself night be expanded to permit it to focus on effective 
programs for students a. risk, or a similai system might be developed within 
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Chapter I and/or special education. In addition, some proportion of Chapter 
I or special education funds could be set aside to pay the costs of bringing in 
and maintaining validated programs. 

At present, both Chapter I and special education funding arc as secure 
as they have ever been For example, the vote reauthorizing Chapter ! was 
nearly unanimous in both the House and the Senate. Certainly, efforts to 
maintain and expand funding for these programs must continue, but we must 
also turn our attention to ensuring that the programs purchased by these 
funds arc the best they can be. We must undertake a major commitment to 
expand the knowledge base underlying effective programs for students at 
risk, to develop and evaluate new programs, and to set up dissemination 
procedures to ensure high-quality implementation of effective programs at 
the school and classroom level. Without such a commitment, it is certain that 
remedial and special education programs will continue to piodutc the medio- 
cre results they produce now. We can do much better. The tragic waste of the 
learning potential of so many of our children is needless and cannot be 
allowed to continue. A long road lies ahead to make ihe needed changes, but 
at the end of that road is a chance to transform the elementary school into an 
institution that guarantees all students an adequate level of basic skills. 
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